Gene EcSMS35_3372 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3372 
Symbol 
ID6143075 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3452225 
End bp3453658 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content50% 
IMG OID641618201 
Productamino acid permease family protein 
Protein accessionYP_001745350 
Protein GI170679908 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGATA CCAAACGTAA TACAATCGGC AAATTCGGCT TGCTCTCGCT GACTTTTGCC 
GCCGTTTACA GCTTTAACAA CGTTATCAAT AATAATATTG AGCTTGGACT GGCCTCGGCA
CCGATGTTTT TCCTCGCGAC GATTTTTTAT TTTATTCCCT TCTGTCTGAT CATTGCAGAA
TTTGTTTCGT TAAATAAAAA CTCAGAAGCC GGTGTCTACG CGTGGGTAAA AAGTTCGCTG
GGCGGACGTT GGGCATTTAT TACTGCCTAT ACCTACTGGT TCGTAAACCT GTTCTTTTTC
ACCTCGCTGT TACCGCGCGT TATTGCTTAT GCTTCGTATG CCTTCCTCGG CTACGAATAT
ATTATGACGC CGGTTGCCAC CACCATTATC AGTATGGTGC TGTTCGCCTT CTCCACATGG
GTTTCCACCA ACGGGGCGAA AATGCTGGGG CCAATCACCT CCGTCACTTC AACGCTGATG
CTGCTGTTAA CGCTCTCCTA CATTTTACTG GCAGGTACGG CACTGGTTGG CGGCGTACAG
CCTGCCGATC CGATCACCGT TGACGCGATG ATCCCGAACT TCAACTGGGC GTTCCTCGGC
GTGACCACCT GGATCTTTAT GGCCGCAGGT GGCGCGGAGT CCGTCGCGGT GTACGTTAAC
GACGTCAAAG GCGGTTCGAA ATCGTTCGTT AAAGTGATCA TCCTCGCCGG GATTTTTATC
GGCGTACTGT ATTCCGTCTC CTCGGTGCTG ATTAACGTCT TCGTCAGCAG CAAAGAGTTG
AAATTTACTG GCGGATCGGT ACAGGTATTC CACGGCATGG CGGCGTATTT TGGTCTACCG
GAAGCACTGA TGAATCGCTT TGTCGGTCTG GTGTCCTTTA CCGCGATGTT CGGTTCCCTG
CTGATGTGGA CGGCAACGCC GGTGAAAATT TTCTTCTCCG AAATTCCGGA AGGCATCTTT
GGTAAGAAAA CCGTCGAACT GAACGAAAAC GGCGTTCCGG CGCGCGCAGC GTGGATCCAG
TTCCTGATCG TCATCCCGCT GATGATTATC CCGATGCTCG GTTCCAATAC CGTGCAGGAT
CTGATGAATA CCATTATTAA TATGACCGCC GCAGCGTCCA TGCTTCCGCC GTTATTCATC
ATGCTGGCTT ACCTGAATTT ACGCGCCAAA TTAGATCACC TGCCACGCGA TTTCCGTATG
GGTTCCCGAC GCACCGGTAT TATCGTTGTT TCAATGCTGA TTGCGATATT TGCTGTAGGG
TTTGTCGCTT CGACATTCCC GACTGGCGCG AATATTCTGA CCATCATTTT TTATAACGTC
GGCGGTATTG TTATCTTCCT CCGCTTTGCG TGGTGGAAAT ACAGTAAATA TATAAAGGGA
TTAACGGCTG AAGAGCGCCA TATTGAAGCG ACGCCAGCCA GCAATGTTGA TTAA
 
Protein sequence
MSDTKRNTIG KFGLLSLTFA AVYSFNNVIN NNIELGLASA PMFFLATIFY FIPFCLIIAE 
FVSLNKNSEA GVYAWVKSSL GGRWAFITAY TYWFVNLFFF TSLLPRVIAY ASYAFLGYEY
IMTPVATTII SMVLFAFSTW VSTNGAKMLG PITSVTSTLM LLLTLSYILL AGTALVGGVQ
PADPITVDAM IPNFNWAFLG VTTWIFMAAG GAESVAVYVN DVKGGSKSFV KVIILAGIFI
GVLYSVSSVL INVFVSSKEL KFTGGSVQVF HGMAAYFGLP EALMNRFVGL VSFTAMFGSL
LMWTATPVKI FFSEIPEGIF GKKTVELNEN GVPARAAWIQ FLIVIPLMII PMLGSNTVQD
LMNTIINMTA AASMLPPLFI MLAYLNLRAK LDHLPRDFRM GSRRTGIIVV SMLIAIFAVG
FVASTFPTGA NILTIIFYNV GGIVIFLRFA WWKYSKYIKG LTAEERHIEA TPASNVD