Gene EcHS_A1771 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1771 
SymbolydiM 
ID5591531 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1793128 
End bp1794309 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content45% 
IMG OID640920921 
Productmajor facilitator family transporter 
Protein accessionYP_001458473 
Protein GI157161155 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones49 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTATTTTA ATTACCTGGT GCATGGTATG GGCGTCATTT TGATGAGCCT GAATATGGCC 
TCGCTGGAGA CACTTTGGCA GACTAATGCC GCGGGTGTCT CGATAGTTAT CTCATCGCTG
GGCATTGGTC GATTAAGTGT CTTGCTTTTT GCAGGATTAT TATCCGATCG CTTTGGTCGC
CGCCCTTTTA TCATGCTCGG GATGTGCTGC TATATGGCCT TCTTTTTTGG CATCCTGCAG
ACCAATAACA TCATTATCGC TTATGTTTTT GGCTTTCTGG CGGGAATGGC AAACAGTTTT
CTCGATGCAG GCACTTATCC CAGTTTGATG GAAGCTTTTC CACGCTCACC TGGGACAGCC
AATATTTTAA TTAAAGCATT TGTTTCCAGC GGACAATTTT TATTACCGCT AATCATTAGC
CTGTTAGTGT GGGCTGAACT GTGGTTCGGT TGGTCCTTTA TGATTGCTGC AGGCATTATG
TTTATTAACG CTCTGTTTTT ATACCGTTGT ACGTTCCCAC CCCATCCGGG TCGTCGCTTA
CCTGTCATAA AGAAAACCAC CAGCTCTACG GAACATCGCT GTTCAATTAT CGATTTAGCC
AGTTATACCT TATATGGCTA TATCTCAATG GCAACGTTTT ATCTGGTTAG CCAGTGGCTG
GCACAGTACG GACAATTTGT TGCAGGCATG TCATACACTA TGTCGATCAA ACTACTCAGT
ATCTACACCG TGGGTTCGCT GCTTTGTGTA TTTATTACCG CTCCACTCAT TCGTAATACC
GTTCGCCCAA CAACATTACT GATGCTGTAC ACCTTTATCT CATTTATTGC TCTGTTTACC
GTCTGCCTGC ATCCCACATT TTATGTGGTG ATAATATTTG CTTTTGTCAT TGGTTTTACC
TCTGCTGGAG GTGTTGTGCA AATTGGCCTG ACGTTAATGG CTGAACGTTT CCCTTACGCT
AAAGGTAAAG CTACAGGGAT CTATTACAGT GCGGGCAGTA TTGCGACCTT TACTATTCCG
TTGATTACGG CTCATCTGTC CCAAAGAAGT ATTGCCGATA TTATGTGGTT CGATACCGCC
ATCGCTGCCA TCGGTTTTTT ACTGGCACTG TTTATCGGCT TACGCAGCCG CAAAAAAACG
CGGCATCACT CGCTAAAGGA AAATGTCGCT CCGGGTGGGT AA
 
Protein sequence
MYFNYLVHGM GVILMSLNMA SLETLWQTNA AGVSIVISSL GIGRLSVLLF AGLLSDRFGR 
RPFIMLGMCC YMAFFFGILQ TNNIIIAYVF GFLAGMANSF LDAGTYPSLM EAFPRSPGTA
NILIKAFVSS GQFLLPLIIS LLVWAELWFG WSFMIAAGIM FINALFLYRC TFPPHPGRRL
PVIKKTTSST EHRCSIIDLA SYTLYGYISM ATFYLVSQWL AQYGQFVAGM SYTMSIKLLS
IYTVGSLLCV FITAPLIRNT VRPTTLLMLY TFISFIALFT VCLHPTFYVV IIFAFVIGFT
SAGGVVQIGL TLMAERFPYA KGKATGIYYS AGSIATFTIP LITAHLSQRS IADIMWFDTA
IAAIGFLLAL FIGLRSRKKT RHHSLKENVA PGG