Gene EcHS_A3902 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3902 
Symbol 
ID5592377 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3896548 
End bp3897648 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content52% 
IMG OID640923010 
Productputative oxidoreductase 
Protein accessionYP_001460487 
Protein GI157163169 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones77 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACATT TCGACGTGGC GATTATTGGC CTCGGCCCGG CAGGGTCGGC GTTGGCACGA 
AAGTTAGCCG GCAAAATGCA GGTGATCGCG CTGGATAAAA AGCACCAGTG TGGTACTGAA
GGTTTCAGCA AACCTTGTGG CGGTCTGCTG GCACCGGACG CGCAGCGTTC TTTTATTCGC
GATGGACTGA CGCTCCCTGT CGATGTGATC GCCAATCCGC AGATTTTCAG CGTCAAAACC
GTCGACGTCG CCGCATCGCT CACACGTAAC TACCAGCGAA GCTATATCAA TATTAATCGC
CACGCTTTCG ACTTGTGGAT GAAATCACTG ATCCCCGCCA GCGTTGAGGT TTACCACGAT
AGCCTGTGCC GGAAAATCTG GCGTGAGGAT GATAAATGGC ATGTCATTTT TCGTGCAGAC
GGTTGGGAGC AGCATATTTC CGCCCGCTAT CTGGTCGGTG CCGATGGTGC CAACTCGATG
GTGCGGCGAC ATCTCTACCC GGATCATCAA ATCCGTAAAT ATGTCGCTAT CCAGCAGTGG
TTTGCAGAGA AACATCCGGT ACCGTTCTAC TCCTGCATCT TTGATAATGA AATAACTGAC
TGTTATTCAT GGAGTATCAG CAAAGACGGT TATTTTATCT TTGGCGGTGC TTATCCAATG
AAAGACGGTC AGACGCGTTT CACGACGCTG AAAGAGAAAA TGAGCGCCTT TCAGTTCCAG
TTTGGTAAGG CGGTGAAAAG CGAAAAATGC ACGGTGCTGT TTCCCTCGCG CTGGCAGGAT
TTTGTCTGCG GTAAGGACAA CGCCTTTCTG ATTGGCGAAG CGGCAGGATT TATCAGCGCC
AGCTCGCTGG AGGGGATTAG CTATGCGCTG GATAGCGCAG AGATTCTGCG TGCGGTGTTA
CTGAAGCAGC CGGAGAAGAG CAACGCCGCC TACTGGCGCG CCACCCGCAA ACTGCGTTTA
AAACTCTTCG GCAAGATAGT AAAAAGCCGA TGCCTGACCG CACCGGCTTT AAGAAAGTGG
ATTATGCGCA GTGGTGTGGC GCATATTCCA CAGTTGAAAG ATTATCCAAC GCGCTTCACA
TCGCCCACCA GCAGGATGTA A
 
Protein sequence
MEHFDVAIIG LGPAGSALAR KLAGKMQVIA LDKKHQCGTE GFSKPCGGLL APDAQRSFIR 
DGLTLPVDVI ANPQIFSVKT VDVAASLTRN YQRSYININR HAFDLWMKSL IPASVEVYHD
SLCRKIWRED DKWHVIFRAD GWEQHISARY LVGADGANSM VRRHLYPDHQ IRKYVAIQQW
FAEKHPVPFY SCIFDNEITD CYSWSISKDG YFIFGGAYPM KDGQTRFTTL KEKMSAFQFQ
FGKAVKSEKC TVLFPSRWQD FVCGKDNAFL IGEAAGFISA SSLEGISYAL DSAEILRAVL
LKQPEKSNAA YWRATRKLRL KLFGKIVKSR CLTAPALRKW IMRSGVAHIP QLKDYPTRFT
SPTSRM