Gene EcSMS35_4197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4197 
Symbol 
ID6142696 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4297574 
End bp4298797 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content51% 
IMG OID641619020 
Productcytosine/purines/uracil/thiamine/allantoin family permease 
Protein accessionYP_001746148 
Protein GI170680197 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1457] Purine-cytosine permease and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.00118639 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGTAAAA AAGAAGAGAA TCTGAATACG GCATCAGGAT TGCGTATTGC CATGATTTTG 
CTGGGTATTG CCGTCACACC TGTGCTGTTG TCATCCTCAA GCCTCGGCAA TCAACTTTCC
AGCGGCAGTT TAATTAGCGT CGTATTGTTA GGCGGCGTCA TTCTGACCTT ACTTTCAGCC
ATCACCATTA GCGTGGGAGA AAAAGCCCGC CTACCAACGT ATGGCATTGT GAAATATTCG
TTTGGCGAAA AAGGGGCCAT CGCCATTAAC ATTTTGATGG CGATAAGTCT GTTCGGCTGG
ATTGCCGTTA CCGCCAATAT GTTTGGTCAT TCGGTACATG ACTTACTGGC TCAACATGGA
CTGGAAGTTC CACTGGCACT GTTAGTGGCG GCTGGCTGTG TCATTTTTGT CGCCTCTACG
GCATTTGGCT TTGCCGTTCT GGGAAAAATT GCCCAGGTTG CCGTGCCGGT TATCGCGCTG
GTGCTGTGTT ACATCCTCTA TGTGGCAACC CATACCGAAG TGGCAGTACC AGCGGCGATT
GTGGAGATGA ATACAGGTGT CGCCGTTTCC ACCGTTGTTG GCACCATTAT TGTGCTGGTT
GCCACACTGC CTGATTTCGG TAGTTTTGTG CATAACCGCA AACATGCGCT GATTGCCGCA
GGCGTGACGT TTCTGGTTGC CTACCCTCTG CTCTACTGGG CGGGTGCAAC GCCGAGCGCC
ATTAGTGGTC AGGGATCTTT ACTGGGTGCG ATGGCGGTAT TCGGTGCGGT TCTGCCTGCG
GCGCTGTTGT TGATTTTCGC CTGCGTCACC GGTAACGCGG GCAATATGTT CCAGGGCACG
CTGGTGGTTT CCACACTGCT TACCCGCTTT CCCAAATGGC AGATTACCGT GGCGCTGGGT
ATCCTTTCCG CCATCGTAGG CAGTATGGAT ATTATGGCGT GGTTTATTCC GTTTCTGCTG
TTCCTGGGTA TCGCCACGCC ACCCGTTGCC GGAATTTATA TCGCTGACTT TTTCTTTTAT
CGCCGTAATG GCTATCAAGA GTCAGTGTTA GCCCAGGAGT CACAGATTAA AGTGCTGACA
TTCGCAGCAT GGATCATAGG CGCAGCGGTT GGCTTTATGA CCGTAAAAGG CTTATTCACC
CTGACGACGA TCCCTTCGGT AGACTCGATT CTGGTGGCAT GTATCGCTTA TGCGATTCTC
AGTCGGGCAA GTCAACACCG CTAA
 
Protein sequence
MRKKEENLNT ASGLRIAMIL LGIAVTPVLL SSSSLGNQLS SGSLISVVLL GGVILTLLSA 
ITISVGEKAR LPTYGIVKYS FGEKGAIAIN ILMAISLFGW IAVTANMFGH SVHDLLAQHG
LEVPLALLVA AGCVIFVAST AFGFAVLGKI AQVAVPVIAL VLCYILYVAT HTEVAVPAAI
VEMNTGVAVS TVVGTIIVLV ATLPDFGSFV HNRKHALIAA GVTFLVAYPL LYWAGATPSA
ISGQGSLLGA MAVFGAVLPA ALLLIFACVT GNAGNMFQGT LVVSTLLTRF PKWQITVALG
ILSAIVGSMD IMAWFIPFLL FLGIATPPVA GIYIADFFFY RRNGYQESVL AQESQIKVLT
FAAWIIGAAV GFMTVKGLFT LTTIPSVDSI LVACIAYAIL SRASQHR