Gene EcolC_1999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1999 
SymbolrnfD 
ID6068135 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2206475 
End bp2207533 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content53% 
IMG OID641601413 
Productelectron transport complex protein RnfD 
Protein accessionYP_001724972 
Protein GI170020018 
COG category[C] Energy production and conversion 
COG ID[COG4658] Predicted NADH:ubiquinone oxidoreductase, subunit RnfD 
TIGRFAM ID[TIGR01946] electron transport complex, RnfABCDGE type, D subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00297126 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.240124 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTATTCA GAATAGCTAG CTCCCCTTAT ACCCATAACC AGCGCCAGAC ATCGCGCATT 
ATGCTGTTGG TGTTGCTCGC AGCCGTGCCA GGAATCGCAG CGCAACTGTG GTTTTTTGGT
TGGGGTACTC TCGTTCAGAT CCTGTTGGCA TCGGTTAGTG CTCTGTTAGC CGAAGCTCTC
GTACTCAAAC TACGCAAGCA GTCGGTAGCC GCAACGTTGA AAGATAACTC AGCATTGCTG
ACAGGCTTAT TGCTGGCGGT AAGTATTCCC CCCCTCGCGC CATGGTGGAT GGTCGTGCTG
GGTACGGTGT TTGCGGCGAT CATCGCTAAA CAGTTGTATG GCGGTCTGGG GCAAAACCCG
TTTAATCCGG CAATGATTGG TTATGTGGTC TTACTGATCT CCTTCCCTGT GCAGATGACC
AGCTGGTTAC CGCCACATGA AATTGCGGTC AACATCCCTG GTTTTATCGA CGCCATACAG
GTTATTTTCA GCGGACATAC CGCCAGTGGT GGTGATATGA ACACACTACG CTTAGGTATT
GATGGCATTA GTCAGGCGAC ACCGCTGGAT ACATTTAAAA CCTCTGTCCG TGCCGGTCAT
TCGGTTGAAC AGATTATGCA ATATCCGATC TACAGCGGTA TTCTGGCGGG CGCTGGTTGG
CAATGGGTAA ATCTCGCCTG GCTGGCTGGC GGCCTGTGGT TGCTATGGCA GAAAGCGATT
CGCTGGCATA TTCCCCTCAG CTTCTTAGTA ACGCTGGCGT TATGCGCAAC GTTGGGCTGG
TTGTTCTCAC CAGAAACACT GGCAGCACCG CAAATTCATC TGCTGTCTGG TGCGACCATG
CTCGGCGCAT TCTTTATTTT GACTGACCCG GTTACCGCTT CTACGACCAA TCGTGGTCGT
CTTATTTTCG GCGCGCTGGC GGGCTTATTA GTCTGGTTGA TCCGCAGTTT CGGCGGCTAT
CCTGACGGCG TGGCTTTTGC CGTCCTGCTG GCGAACATCA CGGTTCCTCT GATCGATTAC
TACACGCGTC CGCGCGTCTA CGGCCATCGC AAAGGGTAA
 
Protein sequence
MVFRIASSPY THNQRQTSRI MLLVLLAAVP GIAAQLWFFG WGTLVQILLA SVSALLAEAL 
VLKLRKQSVA ATLKDNSALL TGLLLAVSIP PLAPWWMVVL GTVFAAIIAK QLYGGLGQNP
FNPAMIGYVV LLISFPVQMT SWLPPHEIAV NIPGFIDAIQ VIFSGHTASG GDMNTLRLGI
DGISQATPLD TFKTSVRAGH SVEQIMQYPI YSGILAGAGW QWVNLAWLAG GLWLLWQKAI
RWHIPLSFLV TLALCATLGW LFSPETLAAP QIHLLSGATM LGAFFILTDP VTASTTNRGR
LIFGALAGLL VWLIRSFGGY PDGVAFAVLL ANITVPLIDY YTRPRVYGHR KG