Gene EcSMS35_4627 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4627 
Symbol 
ID6143335 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4728446 
End bp4729990 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content47% 
IMG OID641619443 
Productamino acid permease family protein 
Protein accessionYP_001746554 
Protein GI170680322 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.592733 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.0531629 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGCGCGT TTTATACACG CGCTGAAATG AAGGATGGTT TCATGCCTCA CACGATAAAA 
AAGATGAGTC TGATAGGACT CATATTGATG ATCTTTACTT CCGTATTTGG ATTTGCCAAT
AGCCCATCGG CTTATTACTT AATGGGTTAT AGTGCGATTC CCTTTTATAT ATTTTCTGCA
TTGTTATTCT TTATTCCATT CGCCTTAATG ATGGCTGAAA TGGGAGCGGC TTATCGCAAA
GAAGAAGGTG GTATCTATTC CTGGATGAAT AATAGTGTCG GACCACGTTT TGCCTTCATT
GGTACATTTA TGTGGTTTTC CTCTTATATC ATCTGGATGG TGAGTACCTC AGCGAAAGTT
TGGGTACCGT TCTCAACATT CCTCTATGGT AGCGACATGA CCCAGCACTG GCGTATTGCC
GGACTGGAGC CTACGCAGGT GGTTGGTCTG CTGGCAGTGG CATGGATGAT TCTGGTCACC
GTCGTTGCTT CAAAGGGGAT TAATAAAATT GCCCGCATTA CTGCGGTGGG CGGTATTGCA
GTAATGTGTC TGAATTTAGT ATTGCTGTTA GTAAGCATTA CTATTTTGTT ATTAAATGGT
GGGCATTTCG CGCAGGATAT TAATTTCCTT GCATCACCGA ACCCAGGTTA TCAGTCCGGT
CTGGCAATGC TATCGTTTGT GGTATTTGCT ATTTTTGCCT ATGGCGGAAT TGAAGCGGTG
GGTGGTCTGG TCGATAAAAC GGAAAATCCA GAAAAGAACT TTGCCAAAGG TATTGTTTTT
GCCGCTATTG TTATTTCAAT CGGTTATTCG CTGGCAATAT TTTTATGGGG CGTCAGCACA
AACTGGCAGC AGGTATTAAG TAATGGTTCC GTTAACCTCG GCAATATTAC CTATGTGCTG
ATGAAGAGCC TCGGGGTGAC GCTGGGTAAC GCACTGCATT TTTCACCTGA AGCGTCATTG
TCGCTGGGCG TATGGTTTGC GCGTATTACC GGACTGTCGA TGTTCCTCGC TTATACCGGT
GCGTTCTTTA CGCTTTGCTA TTCACCGCTG AAAGCCATCA TCCAGGGAAC GCCGAAAGCA
TTGTGGCCGG AACCGATGAC GCGCCTGAAT ACGATGGGGA TGCCGTCTAT TGCCATGTGG
ATGCAGTGCG GGTTGGTTAC TATCTTCATT CTGCTGGTTT CGTTTGGTGG CGGTACCGCA
TCGGCGTTCT TTAACAAGCT GACGCTGATG GCGAACGTGT CTATGACGCT TCCTTACCTG
TTCCTCGCGC TGGCTTTCCC ATTTTTTAAA GCACGTCAGG ATCTCGACAG ACCGTTTGTG
ATTTTCAAAA CGCATATGTC GGCAATGATT GCAACAGTGG TTGTCGTACT GGTGGTGACA
TTTGCGAACG TCTTCACCAT CATTCAACCT GTGGTTGAAG CTGGAGACTG GGACAGCACA
TTGTGGATGA TTGGCGGCCC TGTCTTCTTC TCGCTGTTAG CGATGGCGAT TTACCAGAAC
TATTGCAGTC GCATGGCGAA CAAACCTGAG TTAGCTCTCG ACTGA
 
Protein sequence
MGAFYTRAEM KDGFMPHTIK KMSLIGLILM IFTSVFGFAN SPSAYYLMGY SAIPFYIFSA 
LLFFIPFALM MAEMGAAYRK EEGGIYSWMN NSVGPRFAFI GTFMWFSSYI IWMVSTSAKV
WVPFSTFLYG SDMTQHWRIA GLEPTQVVGL LAVAWMILVT VVASKGINKI ARITAVGGIA
VMCLNLVLLL VSITILLLNG GHFAQDINFL ASPNPGYQSG LAMLSFVVFA IFAYGGIEAV
GGLVDKTENP EKNFAKGIVF AAIVISIGYS LAIFLWGVST NWQQVLSNGS VNLGNITYVL
MKSLGVTLGN ALHFSPEASL SLGVWFARIT GLSMFLAYTG AFFTLCYSPL KAIIQGTPKA
LWPEPMTRLN TMGMPSIAMW MQCGLVTIFI LLVSFGGGTA SAFFNKLTLM ANVSMTLPYL
FLALAFPFFK ARQDLDRPFV IFKTHMSAMI ATVVVVLVVT FANVFTIIQP VVEAGDWDST
LWMIGGPVFF SLLAMAIYQN YCSRMANKPE LALD