Gene EcHS_A3343 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3343 
Symbol 
ID5591983 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3348024 
End bp3349064 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content58% 
IMG OID640922461 
Productputative permease 
Protein accessionYP_001459954 
Protein GI157162636 
COG category[R] General function prediction only 
COG ID[COG0701] Predicted permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones62 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGGTC AGTCTTCATC TCAGGCGGCA ACGCCCATTC AGTGGTGGAA ACCCGCGCTT 
TTCTTTCTCG TTGTCATTGC CGGTCTCTGG TATGTGAAAT GGGAACCTTA CTACGGCAAA
GCGTTTACTG CTGCCGAAAC CCACAGTATC GGTAAATCTA TCCTTGCGCA GGCGGATGCT
AACCCATGGC AGGCGGCGTT GGATTACGCG ATGATCTATT TCCTCGCGGT ATGGAAAGCG
GCGGTGCTGG GGGTGATCCT CGGTTCGTTG ATTCAGGTGC TGATCCCGCG TGACTGGTTG
TTGCGTACGC TTGGGCAATC GCGCTTTCGC GGCACGCTGC TGGGAACGCT GTTTTCGTTG
CCGGGCATGA TGTGTACCTG CTGTGCGGCT CCGGTCGCGG CGGGAATGCG TCGCCAACAG
GTGTCGATGG GCGGTGCGCT GGCATTCTGG ATGGGCAATC CGGTGTTAAA CCCGGCGACG
CTGGTGTTTA TGGGCTTTGT CCTCGGCTGG GGTTTTGCGG CGATTCGTCT GGTGGCCGGG
CTGGTGATGG TGTTGCTGAT TGCGACGCTG GTGCAAAAAT GGGTGCGTGA AACACCGCAA
ACGCAGGCAC CGGTCGAAAT TGACATACCG GAAGCACAGG GCGGGTTTTT TAGCCGCTGG
GGCAGGGCGC TATGGACGCT TTTCTGGAGT ACGATCCCGG TTTACATCCT TGCAGTACTG
GTGTTGGGTG CCGCTCGCGT CTGGTTATTC CCCCATGCCG ATGGTGCTGT CGATAACAGC
CTGATGTGGG TGGTGGCGAT GGCGGTAGCA GGATGCTTGT TTGTCATTCC CACGGCAGCA
GAAATTCCGA TTGTACAAAC GATGATGCTG GCAGGTATGG GAACCGCTCC GGCGCTGGCA
TTGTTGATGA CGCTCCCGGC GGTGAGTTTG CCGTCACTGA TTATGCTGCG CAAAGCGTTC
CCGGCGAAAG CCTTATGGCT GATGGGGGCG ATGGTGGCAG TGTCTGGTGT GATTGTCGGC
GGGCTGGCGC TGTTGTTCTG A
 
Protein sequence
MTGQSSSQAA TPIQWWKPAL FFLVVIAGLW YVKWEPYYGK AFTAAETHSI GKSILAQADA 
NPWQAALDYA MIYFLAVWKA AVLGVILGSL IQVLIPRDWL LRTLGQSRFR GTLLGTLFSL
PGMMCTCCAA PVAAGMRRQQ VSMGGALAFW MGNPVLNPAT LVFMGFVLGW GFAAIRLVAG
LVMVLLIATL VQKWVRETPQ TQAPVEIDIP EAQGGFFSRW GRALWTLFWS TIPVYILAVL
VLGAARVWLF PHADGAVDNS LMWVVAMAVA GCLFVIPTAA EIPIVQTMML AGMGTAPALA
LLMTLPAVSL PSLIMLRKAF PAKALWLMGA MVAVSGVIVG GLALLF