Gene EcDH1_0899 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_0899 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp961663 
End bp963015 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content52% 
IMG OID 
Productd-galactonate transporter 
Protein accessionACX38582 
Protein GI260448160 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTCTT TAAGTCAGGC TGCGAGCAGT GTGGAAAAAC GCACAAATGC TCGTTACTGG 
ATAGTGGTGA TGTTGTTTAT CGTCACATCC TTCAACTACG GCGACCGCGC TACGCTCTCT
ATCGCCGGTT CGGAAATGGC CAAAGATATC GGCCTTGATC CCGTGGGAAT GGGCTATGTG
TTCTCTGCTT TCTCATGGGC TTATGTTATC GGGCAGATCC CTGGTGGCTG GTTGCTGGAC
CGTTTTGGTT CAAAACGCGT CTACTTCTGG TCGATCTTTA TCTGGTCGAT GTTTACCTTG
CTGCAAGGCT TCGTCGATAT CTTTAGTGGA TTCGGCATTA TCGTTGCCCT GTTTACGCTG
CGCTTCCTGG TCGGGCTTGC TGAAGCGCCA TCTTTCCCCG GCAACAGTCG CATTGTTGCG
GCCTGGTTTC CGGCGCAGGA AAGGGGAACG GCGGTGTCGA TTTTTAACTC CGCTCAATAC
TTCGCAACGG TGATCTTCGC GCCGATTATG GGCTGGCTGA CGCATGAAGT GGGCTGGTCA
CACGTCTTCT TCTTTATGGG CGGTCTGGGG ATTGTCATCA GCTTTATCTG GTTGAAAGTC
ATCCACGAGC CAAATCAACA TCCGGGGGTA AATAAGAAAG AGCTGGAGTA CATCGCCGCG
GGTGGTGCGC TGATCAATAT GGATCAGCAA AACACCAAAG TTAAAGTGCC GTTCAGCGTG
AAGTGGGGGC AGATCAAACA GCTGCTAGGG TCACGGATGA TGATCGGCGT TTATATCGGT
CAGTACTGTA TCAACGCCCT GACTTACTTC TTTATTACCT GGTTCCCGGT TTATCTGGTG
CAGGCACGCG GGATGTCGAT TCTGAAAGCG GGCTTTGTGG CTTCCGTTCC GGCGGTTTGC
GGTTTTATCG GCGGTGTGCT GGGTGGGATT ATTTCCGACT GGCTGATGCG CCGCACGGGA
TCGCTGAACA TTGCGCGTAA AACACCGATC GTAATGGGCA TGTTGCTGTC GATGGTGATG
GTGTTCTGCA ACTACGTCAA CGTTGAGTGG ATGATCATCG GCTTTATGGC GCTGGCCTTC
TTCGGTAAGG GCATCGGGGC GCTGGGTTGG GCAGTAATGG CAGATACCGC GCCAAAAGAG
ATCAGCGGTC TTTCCGGTGG CCTGTTCAAC ATGTTCGGTA ACATTTCTGG CATCGTCACG
CCAATCGCAA TTGGTTATAT CGTTGGCACG ACTGGCTCGT TTAATGGGGC GCTGATTTAT
GTTGGTGTTC ATGCCTTAAT CGCGGTACTG AGCTACCTGG TGCTGGTGGG CGATATCAAG
CGTATCGAGT TGAAACCTGT TGCGGGGCAA TAA
 
Protein sequence
MSSLSQAASS VEKRTNARYW IVVMLFIVTS FNYGDRATLS IAGSEMAKDI GLDPVGMGYV 
FSAFSWAYVI GQIPGGWLLD RFGSKRVYFW SIFIWSMFTL LQGFVDIFSG FGIIVALFTL
RFLVGLAEAP SFPGNSRIVA AWFPAQERGT AVSIFNSAQY FATVIFAPIM GWLTHEVGWS
HVFFFMGGLG IVISFIWLKV IHEPNQHPGV NKKELEYIAA GGALINMDQQ NTKVKVPFSV
KWGQIKQLLG SRMMIGVYIG QYCINALTYF FITWFPVYLV QARGMSILKA GFVASVPAVC
GFIGGVLGGI ISDWLMRRTG SLNIARKTPI VMGMLLSMVM VFCNYVNVEW MIIGFMALAF
FGKGIGALGW AVMADTAPKE ISGLSGGLFN MFGNISGIVT PIAIGYIVGT TGSFNGALIY
VGVHALIAVL SYLVLVGDIK RIELKPVAGQ