Gene EcDH1_3800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3800 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp4093204 
End bp4094601 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content53% 
IMG OID 
Productputative sugar-specific permease SgaT/UlaA 
Protein accessionACX41402 
Protein GI260450980 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGATCC TCTACAACAT CTTTACCGTG TTTTTTAACC AGGTCATGAC CAATGCCCCG 
TTGTTGCTGG GTATTGTGAC CTGTCTGGGC TACATCCTAC TGCGCAAAAG TGTCAGCGTT
ATTATTAAAG GCACGATTAA AACCATAATT GGTTTCATGT TGTTGCAGGC AGGGTCCGGC
ATCCTCACCA GCACCTTCAA ACCGGTGGTG GCGAAAATGT CCGAAGTCTA CGGCATTAAC
GGCGCAATTT CCGATACCTA CGCTTCAATG ATGGCAACCA TCGACCGCAT GGGCGATGCC
TATAGCTGGG TGGGTTACGC CGTATTGTTA GCGCTGGCGC TGAACATCTG TTACGTGCTG
TTGCGTCGCA TTACCGGCAT TCGCACAATC ATGTTGACCG GCCACATCAT GTTCCAGCAG
GCCGGGTTGA TTGCCGTTAC GCTGTTTATC TTCGGCTACT CCATGTGGAC CACCATTATC
TGTACCGCGA TTCTGGTTTC GCTCTACTGG GGCATCACTT CCAACATGAT GTACAAGCCG
ACTCAGGAAG TGACGGATGG CTGTGGTTTC TCCATCGGTC ACCAGCAGCA GTTTGCATCA
TGGATTGCCT ATAAAGTCGC GCCGTTCCTC GGCAAAAAAG AGGAGAGCGT TGAAGACCTC
AAATTGCCGG GCTGGCTGAA CATTTTCCAC GACAACATCG TCTCCACGGC GATTGTGATG
ACCATCTTCT TTGGTGCCAT TCTGCTCTCC TTCGGTATCG ACACCGTGCA GGCGATGGCA
GGCAAAGTGC ACTGGACGGT GTACATCCTG CAAACTGGTT TCTCCTTTGC GGTGGCGATC
TTCATCATCA CGCAGGGTGT GCGCATGTTT GTGGCGGAAC TCTCTGAAGC ATTTAACGGC
ATTTCCCAGC GCCTGATCCC AGGTGCGGTT CTGGCGATTG ACTGTGCAGC TATCTATAGC
TTCGCGCCGA ACGCCGTGGT CTGGGGCTTT ATGTGGGGCA CCATCGGTCA GCTGATTGCG
GTTGGCATCC TGGTCGCCTG CGGCTCCTCG ATCCTGATTA TTCCTGGCTT TATCCCGATG
TTCTTCTCTA ACGCCACCAT CGGCGTGTTC GCTAACCACT TCGGCGGCTG GCGTGCGGCG
CTGAAGATTT GTCTGGTGAT GGGGATGATC GAAATCTTTG GTTGCGTCTG GGCGGTGAAA
CTCACCGGTA TGAGTGCCTG GATGGGCATG GCGGACTGGT CGATTCTGGC ACCGCCGATG
ATGCAAGGCT TCTTCTCCAT CGGTATCGCC TTTATGGCCG TCATCATTGT AATTGCACTG
GCTTATATGT TCTTCGCTGG CCGCGCGCTG CGCGCAGAAG AAGATGCAGA AAAACAACTG
GCAGAACAGT CTGCTTAA
 
Protein sequence
MEILYNIFTV FFNQVMTNAP LLLGIVTCLG YILLRKSVSV IIKGTIKTII GFMLLQAGSG 
ILTSTFKPVV AKMSEVYGIN GAISDTYASM MATIDRMGDA YSWVGYAVLL ALALNICYVL
LRRITGIRTI MLTGHIMFQQ AGLIAVTLFI FGYSMWTTII CTAILVSLYW GITSNMMYKP
TQEVTDGCGF SIGHQQQFAS WIAYKVAPFL GKKEESVEDL KLPGWLNIFH DNIVSTAIVM
TIFFGAILLS FGIDTVQAMA GKVHWTVYIL QTGFSFAVAI FIITQGVRMF VAELSEAFNG
ISQRLIPGAV LAIDCAAIYS FAPNAVVWGF MWGTIGQLIA VGILVACGSS ILIIPGFIPM
FFSNATIGVF ANHFGGWRAA LKICLVMGMI EIFGCVWAVK LTGMSAWMGM ADWSILAPPM
MQGFFSIGIA FMAVIIVIAL AYMFFAGRAL RAEEDAEKQL AEQSA