Gene EcolC_1932 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1932 
Symbol 
ID6068607 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2134005 
End bp2135294 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content54% 
IMG OID641601343 
Producthypothetical protein 
Protein accessionYP_001724905 
Protein GI170019951 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0346212 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGATG ACAAATTTGA TGCCATTGTG GTCGGTGCGG GCGTTGCTGG TAGCGTTGCC 
GCACTGGTCA TGGCACGAGC CGGGCTGGAT GTCCTGGTGA TAGAACGCGG CGACAGTGCC
GGATGTAAAA ACATGACCGG CGGGCGCCTT TATGCCCACA CACTTGAAGC AATCATTCCA
GGCTTTGCAG CATCAGCGCC GGTAGAACGC AAGGTCACAC GCGAGAAAAT CTCCTTCTTA
ACCGAAGAGA GCGCCGTTAC CCTCGATTTT CACCGCGAGC AACCAGATGT TCCGCAACAC
GCATCTTATA CCGTATTGCG TAATCGTCTG GACCCGTGGT TGATGGAACA TGCCGAGCAG
GCGGGCGCAC AGTTTATCCC GGGAGTTCGC GTCGATGCGC TGGTTCGTGA AGGAAACAAG
GTCACTGGCG TCCAGGCTGG GGATGATATT CTCGAAACGA ATGTGGTGAT TCTGGCTGAT
GGCGTTAACT CGATGCTTGG CCGCTCGCTG GGAATGGTTC CCGCTTCCGA TCCGCATCAT
TACGCTGTTG GTGTTAAAGA GGTTATTGGC CTCACACCAG AACAGATCAA CGATCGCTTT
AATATTACGG GCGAGGAAGG TGCCGCCTGG CTGTTTGCCG GTTCCCCTTC TGACGGCCTG
ATGGGTGGCG GATTCCTCTA TACCAATAAG GATTCCATAT CGTTGGGGCT GGTTTGTGGA
TTGGGTGATA TCGCCCATGC GCAAAAAAGC GTGCCGCAAA TGCTGGAAGA TTTTAAACAA
CACCCCGCCA TTCGCCCGCT GATTAGCGGC GGCAAACTGC TTGAATATTC CGCGCATATG
GTGCCGGAAG GCGGTCTGGC AATGGTGCCG CAACTGGTTA ACGAGGGCGT GATGATCGTT
GGTGACGCCG CAGGCTTCTG CCTGAATTTG GGTTTTACGG TCCGCGGCAT GGATTTAGCC
ATTGCATCGG CTCAAGCTGC CGCCACAACG GTGATCGCCG CCAAAGAACG CGCGGATTTC
TCCGCCAGCA GTCTTGCGCA ATACAAACGT GAGCTGGAAC AAAGCTGCGT CATGCGTGAT
ATGCAGCATT TTCGCAAGAT CCCGGCGCTG ATGGAAAACC CGCGCCTGTT TAGCCAATAC
CCACGAATGG TAGCCGACAT CATGAACGAG ATGTTCACCA TTGACGGCAA ACCAAACCAG
CCGGTACGCA AAATGATCAT GGGACATGCG AAGAAAATCG GGCTGATCAA CTTGCTGAAA
GATGGCATTA AGGGAGCAAC CGCGCTATGA
 
Protein sequence
MSDDKFDAIV VGAGVAGSVA ALVMARAGLD VLVIERGDSA GCKNMTGGRL YAHTLEAIIP 
GFAASAPVER KVTREKISFL TEESAVTLDF HREQPDVPQH ASYTVLRNRL DPWLMEHAEQ
AGAQFIPGVR VDALVREGNK VTGVQAGDDI LETNVVILAD GVNSMLGRSL GMVPASDPHH
YAVGVKEVIG LTPEQINDRF NITGEEGAAW LFAGSPSDGL MGGGFLYTNK DSISLGLVCG
LGDIAHAQKS VPQMLEDFKQ HPAIRPLISG GKLLEYSAHM VPEGGLAMVP QLVNEGVMIV
GDAAGFCLNL GFTVRGMDLA IASAQAAATT VIAAKERADF SASSLAQYKR ELEQSCVMRD
MQHFRKIPAL MENPRLFSQY PRMVADIMNE MFTIDGKPNQ PVRKMIMGHA KKIGLINLLK
DGIKGATAL