Gene EcDH1_3722 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3722 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp4008530 
End bp4009747 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content36% 
IMG OID 
ProductGeneral substrate transporter 
Protein accessionACX41328 
Protein GI260450906 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAACAG CATGGTATAA ACAAGTTAAT CCACCACAAC GGAAAGCTCT TTTTTCCGCA 
TGGCTTGGAT ATGTATTTGA TGGCTTTGAT TTTATGATGA TATTTTACAT TCTTCATATT
ATAAAAGCAG ATCTTGGCAT TACGGATATT CAGGCTACTT TAATAGGGAC AGTGGCCTTC
ATAGCCAGAC CTATTGGAGG TGGTTTTTTT GGTGCCATGG CTGATAAATA TGGTCGTAAG
CCAATGATGA TGTGGGCAAT TTTCATTTAC TCAGTCGGAA CAGGCCTTAG CGGTATTGCT
ACAAACTTAT ATATGCTCGC AGTTTGCCGT TTTATTGTTG GCTTAGGGAT GTCTGGTGAA
TATGCATGTG CTTCAACTTA TGCGGTAGAA AGTTGGCCTA AAAATCTTCA ATCTAAAGCT
AGTGCTTTTT TGGTAAGTGG TTTTTCTGTT GGAAATATTA TTGCGGCACA AATAATCCCT
CAGTTTGCTG AAGTATATGG ATGGAGAAAC TCTTTTTTTA TAGGCCTGTT ACCAGTTTTA
CTAGTTCTTT GGATCAGAAA AAGTGCTCCA GAAAGTCAGG AGTGGATTGA AGATAAATAT
AAGGATAAAT CAACATTTTT GTCTGTCTTC AGAAAACCAC ATCTTTCAAT CTCTATGATC
GTTTTCCTCG TCTGTTTTTG TCTATTTGGT GCAAACTGGC CGATAAACGG ACTACTTCCT
TCCTACCTGG CAGATAATGG AGTTAATACA GTGGTCATTT CAACTCTGAT GACAATAGCA
GGTTTAGGAA CACTGACAGG TACAATATTT TTTGGTTTTG TTGGTGATAA GATTGGTGTA
AAAAAAGCCT TTGTAGTCGG TCTAATAACT TCATTTATTT TCCTTTGTCC TCTTTTTTTT
ATTTCTGTGA AAAACTCTTC TCTTATAGGA TTATGTCTCT TTGGATTAAT GTTTACAAAT
TTAGGTATTG CAGGGTTGGT TCCAAAATTT ATATATGATT ACTTTCCAAC AAAATTAAGA
GGATTAGGGA CCGGTCTTAT TTATAACTTA GGGGCAACTG GAGGAATGGC CGCACCTGTA
TTAGCTACAT ACATTTCAGG ATATTATGGC TTAGGTGTTT CATTATTCAT TGTTACGGTT
GCATTCTCTG CCTTATTAAT TTTGTTAGTT GGTTTTGATA TTCCAGGTAA AATTTATAAA
CTATCCGTGG CTAAATGA
 
Protein sequence
MATAWYKQVN PPQRKALFSA WLGYVFDGFD FMMIFYILHI IKADLGITDI QATLIGTVAF 
IARPIGGGFF GAMADKYGRK PMMMWAIFIY SVGTGLSGIA TNLYMLAVCR FIVGLGMSGE
YACASTYAVE SWPKNLQSKA SAFLVSGFSV GNIIAAQIIP QFAEVYGWRN SFFIGLLPVL
LVLWIRKSAP ESQEWIEDKY KDKSTFLSVF RKPHLSISMI VFLVCFCLFG ANWPINGLLP
SYLADNGVNT VVISTLMTIA GLGTLTGTIF FGFVGDKIGV KKAFVVGLIT SFIFLCPLFF
ISVKNSSLIG LCLFGLMFTN LGIAGLVPKF IYDYFPTKLR GLGTGLIYNL GATGGMAAPV
LATYISGYYG LGVSLFIVTV AFSALLILLV GFDIPGKIYK LSVAK