Gene EcolC_2000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2000 
Symbol 
ID6068133 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2207534 
End bp2209756 
Gene Length2223 bp 
Protein Length740 aa 
Translation table11 
GC content55% 
IMG OID641601414 
Productelectron transport complex protein RnfC 
Protein accessionYP_001724973 
Protein GI170020019 
COG category[C] Energy production and conversion 
COG ID[COG4656] Predicted NADH:ubiquinone oxidoreductase, subunit RnfC 
TIGRFAM ID[TIGR01945] electron transport complex, RnfABCDGE type, C subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000227688 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.287278 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTAAGT TATTCTCTGC ATTCAGAAAA AATAAAATCT GGGATTTCAA CGGCGGCATC 
CATCCACCGG AGATGAAAAC CCAGTCCAAC GGTACACCCC TGCGCCAGGT ACCCCTGGCG
CAGCGTTTTG TTATTCCACT GAAACAGCAT ATTGGCGCTG AAGGTGAGTT GTGCGTTAGC
GTCGGCGATA AAGTATTGCG CGGCCAGCCG CTTACCCGTG GTCGCGGCAA AATGCTGCCT
GTTCACGCGC CCACCTCGGG TACCGTTACG GCTATTGCGC CCCACTCTAC GGCTCATCCT
TCAGCTTTAG CTGAATTAAG CGTGATTATT GATGCCGATG GTGAAGACTG CTGGATCCCG
CGCGACGGCT GGGCCGATTA TCGCACTCGC AGTCGCGAAG AGTTAATCGA GCGCATACAT
CAGTTTGGTG TTGCCGGGCT GGGCGGTGCA GGATTCCCGA CAGGCGTTAA ATTGCAGGGT
GGCGGAGATA AGATTGAAAC GTTGATTATC AACGCGGCTG AGTGCGAGCC GTACATTACC
GCCGATGACC GTTTGATGCA GGATTGCGCG GCTCAGGTCG TAGAGGGTAT TCGCATTCTT
GCGCATATTC TGCAGCCACG CGAAATTCTT ATCGGCATTG AAGATAACAA ACCGCAGGCG
ATTTCCATGC TGCGCGCGGT GCTGGCGGAC TCTAACGATA TTTCTCTGCG GGTGATTCCA
ACCAAATATC CTTCTGGCGG TGCTAAACAA TTAACCTACA TTCTGACCGG GAAGCAGGTT
CCACATGGCG GGCGTTCATC CGATATCGGC GTATTAATGC AAAACGTCGG CACTGCTTAT
GCAGTGAAAC GTGCCGTTAT TGATGGCGAG CCGATTACCG AGCGTGTTGT AACCCTGACT
GGCGAAGCAA TCGCTCGCCC GGGCAACGTC TGGGCACGGC TGGGGACGCC AGTGCGTCAT
TTATTGAATG ATGCCGGATT CTGCCCCTCT GCCGATCAAA TGGTGATTAT GGGTGGCCCG
CTAATGGGCT TTACCTTGCC ATGGCTGGAT GTCCCGGTCG TAAAGATTAC CAACTGTCTG
TTGGCTCCCT CTGCCAATGA ACTTGGCGAA CCACAGGAAG AACAAAGCTG CATCCGGTGT
AGCGCCTGTG CTGACGCCTG CCCTGCGGAT CTTTTGCCGC AACAGTTGTA CTGGTTCAGC
AAAGGTCAGC AACACGATAA AGCTACCACG CATAACATTG CTGATTGCAT TGAATGTGGG
GCTTGCGCGT GGGTTTGCCC GAGCAATATT CCCCTGGTGC AATATTTCCG TCAGGAAAAA
GCTGAAATTG CGGCTATTCG TCAGGAAGAA AAGCGCGCCG CAGAAGCCAA AGCGCGTTTC
GAAGCGCGCC AGGCTCGTCT GGAGCGCGAA AAAGCGGCTC GCCTTGAACG ACATAAGAGC
GCAGCCGTTC AACCTGCAGC CAAAGATAAA GATGCGATTG CTGCCGCTCT GGCGCGGGTG
AAAGAGAAAC AGGCCCAGGC TACACAGCCT ATTGTGATTA AAGCGGGCGA ACGCCCGGAT
AACAGTGCAA TTATTGCAGC ACGGGAAGCC CGTAAAGCGC AAGCCAGAGC GAAACAGGCA
GAACTGCAGC AAACTAACGA CGCAGCAACC GTTGCTGATC CACGTAAAAC TGCCGTTGAA
GCAGCTATCG CCCGCGCCAA AGCGCGCAAG CTGGAACAGC AACAGGCTAA TGCGGAACCA
GAAGAACAGG TCGATCCGCG CAAAGCCGCC GTCGAAGCCG CTATTGCCCG TGCCAAAGCA
CGCAAGCTGG AACAGCAACA GGCTAATGCC GAGCCAGAAC AACAGGTCGA TCCGCGCAAA
GCCGCCGTCG AAGCCGCTAT TGCCCGTGCC AAAGCACGCA AGCTGGAACA GCAACAGGCT
AATGCCGAGC CAGAACAACA GGTCGATCCG CGCAAAGCCG CCGTCGAAGC CGCTATTGCC
CGAGCCAAAG CGCGCAAACG GGAACAGCAA CCGGCTAATG CGGAGCCAGA AGAACAGGTT
GATCCGCGCA AAGCTGCCGT CGAAGCGGCT ATTGCACGCG CCAAAGCACG CAAGCTGGAA
CAGCAACAGG CTAATGCGGT ACCAGAAGAA CAGGTTGATC CGCGCAAAGC GGCAGTTGCC
GCGGCTATTG CCCGCGCTCA GGCCAAAAAA GCCGCCCAGC AGAAGGTTGT AAACGAGGAC
TAA
 
Protein sequence
MLKLFSAFRK NKIWDFNGGI HPPEMKTQSN GTPLRQVPLA QRFVIPLKQH IGAEGELCVS 
VGDKVLRGQP LTRGRGKMLP VHAPTSGTVT AIAPHSTAHP SALAELSVII DADGEDCWIP
RDGWADYRTR SREELIERIH QFGVAGLGGA GFPTGVKLQG GGDKIETLII NAAECEPYIT
ADDRLMQDCA AQVVEGIRIL AHILQPREIL IGIEDNKPQA ISMLRAVLAD SNDISLRVIP
TKYPSGGAKQ LTYILTGKQV PHGGRSSDIG VLMQNVGTAY AVKRAVIDGE PITERVVTLT
GEAIARPGNV WARLGTPVRH LLNDAGFCPS ADQMVIMGGP LMGFTLPWLD VPVVKITNCL
LAPSANELGE PQEEQSCIRC SACADACPAD LLPQQLYWFS KGQQHDKATT HNIADCIECG
ACAWVCPSNI PLVQYFRQEK AEIAAIRQEE KRAAEAKARF EARQARLERE KAARLERHKS
AAVQPAAKDK DAIAAALARV KEKQAQATQP IVIKAGERPD NSAIIAAREA RKAQARAKQA
ELQQTNDAAT VADPRKTAVE AAIARAKARK LEQQQANAEP EEQVDPRKAA VEAAIARAKA
RKLEQQQANA EPEQQVDPRK AAVEAAIARA KARKLEQQQA NAEPEQQVDP RKAAVEAAIA
RAKARKREQQ PANAEPEEQV DPRKAAVEAA IARAKARKLE QQQANAVPEE QVDPRKAAVA
AAIARAQAKK AAQQKVVNED