Gene EcDH1_3965 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3965 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp4272216 
End bp4273691 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content50% 
IMG OID 
Productsugar transporter 
Protein accessionACX41565 
Protein GI260451143 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones58 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATACCC AGTATAATTC CAGTTATATA TTTTCGATTA CCTTAGTCGC TACATTAGGT 
GGTTTATTAT TTGGCTACGA CACCGCCGTT ATTTCCGGTA CTGTTGAGTC ACTCAATACC
GTCTTTGTTG CTCCACAAAA CTTAAGTGAA TCCGCTGCCA ACTCCCTGTT AGGGTTTTGC
GTGGCCAGCG CTCTGATTGG TTGCATCATC GGCGGTGCCC TCGGTGGTTA TTGCAGTAAC
CGCTTCGGTC GTCGTGATTC ACTTAAGATT GCTGCTGTCC TGTTTTTTAT TTCTGGTGTA
GGTTCTGCCT GGCCAGAACT TGGTTTTACC TCTATAAACC CGGACAACAC TGTGCCTGTT
TATCTGGCAG GTTATGTCCC GGAATTTGTT ATTTATCGCA TTATTGGCGG TATTGGCGTT
GGTTTAGCCT CAATGCTCTC GCCAATGTAT ATTGCGGAAC TGGCTCCAGC TCATATTCGC
GGGAAACTGG TCTCTTTTAA CCAGTTTGCG ATTATTTTCG GGCAACTTTT AGTTTACTGC
GTAAACTATT TTATTGCCCG TTCCGGTGAT GCCAGCTGGC TGAATACTGA CGGCTGGCGT
TATATGTTTG CCTCGGAATG TATCCCTGCA CTGCTGTTCT TAATGCTGCT GTATACCGTG
CCAGAAAGTC CTCGCTGGCT GATGTCGCGC GGCAAGCAAG AACAGGCGGA AGGTATCCTG
CGCAAAATTA TGGGCAACAC GCTTGCAACT CAGGCAGTAC AGGAAATTAA ACACTCCCTG
GATCATGGCC GCAAAACCGG TGGTCGTCTG CTGATGTTTG GCGTGGGCGT GATTGTAATC
GGCGTAATGC TCTCCATCTT CCAGCAATTT GTCGGCATCA ATGTGGTGCT GTACTACGCG
CCGGAAGTGT TCAAAACGCT GGGGGCCAGC ACGGATATCG CGCTGTTGCA GACCATTATT
GTCGGAGTTA TCAACCTCAC CTTCACCGTT CTGGCAATTA TGACGGTGGA TAAATTTGGT
CGTAAGCCAC TGCAAATTAT CGGCGCACTC GGAATGGCAA TCGGTATGTT TAGCCTCGGT
ACCGCGTTTT ACACTCAGGC ACCGGGTATT GTGGCGCTAC TGTCGATGCT GTTCTATGTT
GCCGCCTTTG CCATGTCCTG GGGTCCGGTA TGCTGGGTAC TGCTGTCGGA AATCTTCCCG
AATGCTATTC GTGGTAAAGC GCTGGCAATC GCGGTGGCGG CCCAGTGGCT GGCGAACTAC
TTCGTCTCCT GGACCTTCCC GATGATGGAC AAAAACTCCT GGCTGGTGGC CCATTTCCAC
AACGGTTTCT CCTACTGGAT TTACGGTTGT ATGGGCGTTC TGGCAGCACT GTTTATGTGG
AAATTTGTCC CGGAAACCAA AGGTAAAACC CTTGAGGAGC TGGAAGCGCT CTGGGAACCG
GAAACGAAGA AAACACAACA AACTGCTACG CTGTAA
 
Protein sequence
MNTQYNSSYI FSITLVATLG GLLFGYDTAV ISGTVESLNT VFVAPQNLSE SAANSLLGFC 
VASALIGCII GGALGGYCSN RFGRRDSLKI AAVLFFISGV GSAWPELGFT SINPDNTVPV
YLAGYVPEFV IYRIIGGIGV GLASMLSPMY IAELAPAHIR GKLVSFNQFA IIFGQLLVYC
VNYFIARSGD ASWLNTDGWR YMFASECIPA LLFLMLLYTV PESPRWLMSR GKQEQAEGIL
RKIMGNTLAT QAVQEIKHSL DHGRKTGGRL LMFGVGVIVI GVMLSIFQQF VGINVVLYYA
PEVFKTLGAS TDIALLQTII VGVINLTFTV LAIMTVDKFG RKPLQIIGAL GMAIGMFSLG
TAFYTQAPGI VALLSMLFYV AAFAMSWGPV CWVLLSEIFP NAIRGKALAI AVAAQWLANY
FVSWTFPMMD KNSWLVAHFH NGFSYWIYGC MGVLAALFMW KFVPETKGKT LEELEALWEP
ETKKTQQTAT L