Gene EcE24377A_1916 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_1916 
Symbol 
ID5586089 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp1907216 
End bp1908505 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content54% 
IMG OID640925591 
Producthypothetical protein 
Protein accessionYP_001462994 
Protein GI157154713 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGGATG ACAAATTTGA TGCCATTGTG GTCGGTGCGG GCGTTGCTGG TAGCGTTGCC 
GCACTGGTCA TGGCGCGAGC CGGGCTGGAT GTCCTGGTGA TAGAACGCGG CGACAGTGCC
GGATGTAAAA ACATGACCGG CGGGCGTCTT TATGCCCACA CACTTGAAGC AATCATTCCA
GGCTTTGCAG TATCAGCGCC GGTAGAACGC AAGGTCACAC GCGAGAAAAT CTCCTTCTTA
ACCGAAGAAA GCGCCGTTAC CCTCGATTTT CACCGCGAGC AACCAGATGT TCCGCAACAC
GCATCTTATA CCGTATTGCG TAATCGTCTG GACCCGTGGT TGATGGAACA AGCCGAGCAG
GCGGGCGCAC AGTTTATCCC GGGCGTTCGC GTCGATGCGT TGGTTCGTGA AGGAAACAAG
GTCACTGGCG TGCAGGCCGG GGATGATATT CTCCAAGCGA ATGTGGTGGT TCTAGCTGAT
GGCGTTAACT CGATGCTTGG CCGCTCGCTG GGAATGGTTC CCGCTTCCGA TCCGCATCAT
TACGCTGTTG GTGTTAAAGA GGTTATTGGC CTCACACCAG AACAGATCAA CGATCGCTTT
AATATTACGG GCGAGGAAGG TGCCGCCTGG CTGTTTGCCG GTTCCCCTTC TGACGGCCTG
ATGGGCGGGG GATTTCTCTA TACCAACAAG GATTCCATAT CCTTGGGGCT GGTTTGTGGA
TTGGGTGATA TCGCCCATGC GCAAAAAAGC GTGCCGCAAA TGCTGGAAGA TTTTAAACAA
CACCCCGCCA TTCGCCCACT AATTAGCGGC GGCAAACTGC TTGAATATTC CGCGCATATG
GTGCCAGAAG GCGGTCTGGC AATGGTGCCG CAGATGGTTA ACGATGGCGT GATGATCGTT
GGTGACGCCG CAGGCTTCTG CCTGAATTTG GGTTTTACAG TTCGTGGCAT GGATTTAGCC
ATTGCATCGG CTCAGGCTGC CGCCACAACG GTGATCGCCG CCAAAGAACG CGAGGATTTC
TCCGCCAGCA GTCTGGCGCA ATACAAACGT GAGCTGGAAC AAAGCTGCGT CATGCGCGAT
ATGCAGCATT TCCGCAAGAT CCCGGCGCTG ATGGAAAACC CGCGCCTGTT TAGCCAATAC
CCACGAATGG TCGCCGACAT CATGAACGAG ATGTTCACCA TTGACGGCAA ACCAAACCAG
CCGGTACGCA AAATGATCAT GGGACATGCG AAGAAAATTG GGCTGATCAA CTTGCTGAAA
GATGGCATTA AGGGAGCAAC CGCGCTATGA
 
Protein sequence
MSDDKFDAIV VGAGVAGSVA ALVMARAGLD VLVIERGDSA GCKNMTGGRL YAHTLEAIIP 
GFAVSAPVER KVTREKISFL TEESAVTLDF HREQPDVPQH ASYTVLRNRL DPWLMEQAEQ
AGAQFIPGVR VDALVREGNK VTGVQAGDDI LQANVVVLAD GVNSMLGRSL GMVPASDPHH
YAVGVKEVIG LTPEQINDRF NITGEEGAAW LFAGSPSDGL MGGGFLYTNK DSISLGLVCG
LGDIAHAQKS VPQMLEDFKQ HPAIRPLISG GKLLEYSAHM VPEGGLAMVP QMVNDGVMIV
GDAAGFCLNL GFTVRGMDLA IASAQAAATT VIAAKEREDF SASSLAQYKR ELEQSCVMRD
MQHFRKIPAL MENPRLFSQY PRMVADIMNE MFTIDGKPNQ PVRKMIMGHA KKIGLINLLK
DGIKGATAL