Gene EcDH1_2872 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_2872 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3077799 
End bp3079232 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content50% 
IMG OID 
Productanion transporter 
Protein accessionACX40505 
Protein GI260450083 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.675436 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAGA AATCGTTATG GAAGCTAATT CTGATATTAG CGATCCCATG TATTATTGGT 
TTTATGCCAG CTCCGGCAGG ATTAAGCGAA CTGGCGTGGG TGCTTTTTGG TATTTACCTG
GCGGCCATTG TGGGGCTGGT TATCAAGCCT TTCCCGGAAC CTGTCGTACT GTTAATTGCC
GTTGCTGCCT CAATGGTGGT GGTCGGTAAC TTATCCGACG GTGCGTTTAA AACCACCGCC
GTATTAAGCG GTTACTCTTC AGGTACCACC TGGCTGGTGT TCTCGGCGTT TACCTTAAGC
GCCGCATTTG TGACCACCGG TTTAGGTAAA CGTATTGCCT ATCTGCTGAT TGGTAAAATC
GGTAACACCA CGCTGGGTCT GGGTTACGTT ACGGTATTCC TCGATCTGGT ACTGGCTCCG
GCAACACCGT CTAACACCGC GCGTGCGGGC GGCATTGTGT TACCGATCAT CAACAGCGTG
GCGGTGGCTT TGGGGTCCGA ACCGGAAAAA AGTCCGCGTC GTGTCGGACA TTACCTGATG
ATGTCCATTT ACATGGTCAC CAAAACCACC AGCTATATGT TCTTTACCGC AATGGCGGGG
AACATTCTGG CGCTGAAAAT GATCAACGAC ATTCTGCACC TGCAAATTAG CTGGGGTGGA
TGGGCGCTGG CAGCCGGATT GCCGGGCATC ATTATGCTGC TGGTCACCCC GCTGGTGATT
TACACCATGT ATCCACCAGA AATTAAGAAG GTGGATAACA AAACCATCGC TAAAGCGGGC
CTTGCCGAAC TAGGACCGAT GAAAATCCGC GAAAAAATGC TGCTCGGTGT CTTTGTGCTG
GCGCTGCTGG GCTGGATTTT CAGTAAGTCT CTGGGGGTTG ATGAATCCAC CGTGGCAATC
GTTGTTATGG CAACCATGCT GCTGCTGGGT ATCGTTACCT GGGAAGACGT GGTTAAAAAT
AAAGGCGGCT GGAATACCTT AATCTGGTAC GGCGGTATTA TCGGCTTAAG CTCCTTATTA
TCGAAAGTTA AATTCTTCGA ATGGTTAGCT GAAGTCTTTA AAAATAACCT GGCATTTGAT
GGTCACGGTA ACGTTGCTTT CTTCGTTATT ATTTTCCTCA GCATTATCGT GCGTTATTTC
TTCGCTTCCG GTAGTGCCTA TATCGTTGCT ATGTTACCGG TATTTGCCAT GCTGGCGAAC
GTCTCCGGCG CACCGTTAAT GTTAACCGCG CTGGCACTGT TGTTCTCCAA CTCCTATGGC
GGCATGGTTA CTCACTATGG CGGCGCGGCA GGTCCGGTCA TCTTTGGCGT GGGTTATAAC
GATATTAAAT CCTGGTGGTT GGTCGGTGCG GTACTGACGA TATTAACCTT CCTGGTGCAT
ATCACCCTCG GCGTGTGGTG GTGGAATATG CTGATCGGCT GGAACATGCT GTAA
 
Protein sequence
MNKKSLWKLI LILAIPCIIG FMPAPAGLSE LAWVLFGIYL AAIVGLVIKP FPEPVVLLIA 
VAASMVVVGN LSDGAFKTTA VLSGYSSGTT WLVFSAFTLS AAFVTTGLGK RIAYLLIGKI
GNTTLGLGYV TVFLDLVLAP ATPSNTARAG GIVLPIINSV AVALGSEPEK SPRRVGHYLM
MSIYMVTKTT SYMFFTAMAG NILALKMIND ILHLQISWGG WALAAGLPGI IMLLVTPLVI
YTMYPPEIKK VDNKTIAKAG LAELGPMKIR EKMLLGVFVL ALLGWIFSKS LGVDESTVAI
VVMATMLLLG IVTWEDVVKN KGGWNTLIWY GGIIGLSSLL SKVKFFEWLA EVFKNNLAFD
GHGNVAFFVI IFLSIIVRYF FASGSAYIVA MLPVFAMLAN VSGAPLMLTA LALLFSNSYG
GMVTHYGGAA GPVIFGVGYN DIKSWWLVGA VLTILTFLVH ITLGVWWWNM LIGWNML