Gene EcSMS35_0553 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0553 
Symbol 
ID6146612 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp560822 
End bp562276 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content48% 
IMG OID641615447 
Productallantoin permease 
Protein accessionYP_001742654 
Protein GI170680776 
COG category[F] Nucleotide transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG1953] Cytosine/uracil/thiamine/allantoin permeases 
TIGRFAM ID[TIGR00800] NCS1 nucleoside transporter family 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.962113 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACATC AGAGAAAACT ATTCCAGCAA CGCGGCTATA GCGAAGATCT ATTGCCGAAA 
ACGCAAAGCC AGCGGACCTG GAAAACATTT AACTATTTTA CCTTATGGAT GGGTTCGGTT
CATAACGTTC CCAATTATGT GATGGTCGGC GGCTTTTTTA TTCTCGGCTT GTCTACCTTT
AGTATTATGC TGGCAATTAT CCTCAGCGCC TTTTTCATTG CCGCGGTAAT GGTATTAAAC
GGTGCTGCGG GCAGTAAATA CGGTGTGCCT TTTGCCATGA TCCTGCGTGC TTCTTACGGC
GTACGTGGTG CACTGTTTCC CGGATTATTA AGAGGCGGGA TTGCTGCCAT TATGTGGTTT
GGCCTGCAAT GTTACGCTGG ATCGCTGGCC TGCTTGATCC TGATTGGCAA AATCTGGCCG
GGATTTTTAA CTCTCGGTGG TGATTTTACC CTGTTAGGGC TTTCTCTACC GGGTTTAATC
ACTTTCTTAC TCTTCTGGCT GGTGAACGTC GGAATCGGTT TCGGTGATGG CAAAGTTTTA
AATAAATTCA CTGCCATTCT TAACCCGTGC ATCTATATCG TTTTCGGCGG TATGGCGATT
TGGGCGATTT CGCTGGTCGG GATCGGTCCA ATCTTTGACT ATATTCCGGG CGGTATTCAG
AAAGCAGAAA ACGGTGGCTT CCTGTTCCTG GTGGTGATTA ACGCGGTAGT TGCGGTCTGG
GCGGCACCGG CGGTGAGCGC ATCCGACTTT ACGCAAAACG CCCACTCGTT TCGTGAGCAG
GCGCTGGGGC AAACGCTGGG TTTAGTTGTG GCCTATATTC TGTTTGCGGT CGCCGGGGTA
TGTATTATTG CCGGAGCCAG TATTCACTAC GGCGCGGACA CCTGGAACGT GCTGGATATT
GTTCAGCGTT GGGACAGCCT GTTTGCCTCG TTCTTTGCGG TACTGGTTAT TCTGATGACG
ACTATCTCCA CTAACGCCAC CGGTAATATT ATTCCGGCTG GTTATCAGAT AGCCGCTATT
GCACCGACGA AACTGACCTA TAAAAACGGC GTACTGATTG CCAGTATTAT CAGCTTGCTG
ATCTGCCCGT GGAAATTAAT GGAAAATCAG GACAGCATTT ATCTGTTCCT CGATATTATC
GGCGGAATGC TGGGTCCGGT AATTGGTGTC ATGATGGCGC ATTATTTTGT GGTGATGCGC
GGACAAATTA ATCTTGATGA ACTGTATACC GCAGCAGGGG ATTTCAAATA TTACGATAAC
GGCTTTAACC TTACCGCGTT CTCAGTAACT TTGGTCGCCG TTATTTTATC TCTCGGTGGT
AAATTTATTC CGTTCATGGA GCCTTTATCA CGCGTTTCAT GGTTTGTCGG CGTTATTGTC
GCCTTTGCGG CCTACGCCTT ATTAAAGAAG CGTACTGCAG CAGAAAAAAC AGGAGAACAA
AAAGTCACAG GTTAA
 
Protein sequence
MEHQRKLFQQ RGYSEDLLPK TQSQRTWKTF NYFTLWMGSV HNVPNYVMVG GFFILGLSTF 
SIMLAIILSA FFIAAVMVLN GAAGSKYGVP FAMILRASYG VRGALFPGLL RGGIAAIMWF
GLQCYAGSLA CLILIGKIWP GFLTLGGDFT LLGLSLPGLI TFLLFWLVNV GIGFGDGKVL
NKFTAILNPC IYIVFGGMAI WAISLVGIGP IFDYIPGGIQ KAENGGFLFL VVINAVVAVW
AAPAVSASDF TQNAHSFREQ ALGQTLGLVV AYILFAVAGV CIIAGASIHY GADTWNVLDI
VQRWDSLFAS FFAVLVILMT TISTNATGNI IPAGYQIAAI APTKLTYKNG VLIASIISLL
ICPWKLMENQ DSIYLFLDII GGMLGPVIGV MMAHYFVVMR GQINLDELYT AAGDFKYYDN
GFNLTAFSVT LVAVILSLGG KFIPFMEPLS RVSWFVGVIV AFAAYALLKK RTAAEKTGEQ
KVTG