Gene EcSMS35_3015 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3015 
Symbol 
ID6142729 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3103152 
End bp3104552 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content49% 
IMG OID641617884 
Productxanthine/uracil permease family protein 
Protein accessionYP_001745035 
Protein GI170681803 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG2233] Xanthine/uracil permeases 
TIGRFAM ID[TIGR00801] uracil-xanthine permease
[TIGR03173] xanthine permease 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGATA TAAACCATGC AGGTTCTGAC CTTATATTTG AACTGGAGGA TCGCCCTCCC 
TTTCATCAGG CTCTCGTTGG TGCCATTACC CATCTGTTGG CAATTTTCGT TCCGATGGTA
ACCCCCGCGT TAATCGTGGG TGCGGCCTTA CAGCTTTCCG CTGAAACAAC TGCCTATCTT
GTTTCAATGG CGATGATCGC CTCTGGTATT GGTACCTGGT TACAAGTAAA CCGCTATGGC
ATCGTCGGTT CTGGCCTACT CTCAATTCAG TCAGTCAATT TTTCATTTGT TACGGTCATG
ATTGCGCTGG GCAGCAGCAT GAAAAGCGAC GGTTTTCACG AAGAGTTAAT CATGTCGTCG
CTCCTCGGCG TCTCCTTCGT TGGCGCATTT CTGGTTGTCG GCTCTTCTTT TATCCTGCCC
TATTTACGTC GGGTTATTAC GCCTACCGTC AGCGGCATTG TGGTGCTGAT GATCGGCTTA
AGCCTGATTA AAGTCGGCAT TATCGATTTT GGTGGAGGAT TTGCAGCCAA AAGCAGCGGT
ACATTCGGCA ATTACGAACA TCTCGGCGTT GGTTTATTGG TTTTGATTGT GGTGATCGGC
TTTAACTGCT GTAGCAGTCC GTTGCTACGC ATGGGAGGGA TCGCCATTGG GCTATGTGTC
GGCTATATCG CATCGTTATG CCTGGGCATG GTGGATTTCA GCAGTATGCG CAATTTGCCG
TTAATCACCA TCCCACATCC GTTCAAATAC GGCTTTAGTT TTAGCTTCCA TCAGTTCCTG
GTGGTTGGCA CGATTTATCT GCTTAGCGTG CTGGAAGCTG TCGGCGATAT CACCGCCACG
GCAATGGTTT CCCGCCGCCC CATTCAGGGG GAAGAGTATC AGTCCCGACT GAAAGGCGGC
GTGCTGGCAG ACGGTCTGGT TTCTGTTATC GCCTCCGCTG TCGGATCATT ACCCTTAACC
ACGTTTGCGC AAAATAATGG GGTTATTCAG ATGACTGGCG TCGCTTCACG TTATGTCGGG
CGAACCATCG CGGTAATGCT GGTTATCCTC GGCTTATTTC CGATGATTGG CGGCTTCTTC
ACGACCATTC CCTCGGCAGT TCTGGGAGGC GCAATGACGT TGATGTTTTC CATGATTGCC
ATCGCAGGGA TTCGCATCAT CATCACCAAC GGTTTAAAGC GCCGAGAAAC ACTTATTGTC
GCCACTTCTT TAGGTTTAGG GCTTGGCGTC TCCTACGATC CCGAAATTTT TAAAATATTG
CCAGCCTCTA TTTATGTACT AGTTGAAAAC CCTATTTGTG CTGGCGGGTT AACTGCGATT
TTATTAAATA TTATCCTCCC TGGTGGCTAC CGACAGGAAA ACGTTCTGCC TGGTATTACC
TCAGCGGAAG AGATGGATTA A
 
Protein sequence
MSDINHAGSD LIFELEDRPP FHQALVGAIT HLLAIFVPMV TPALIVGAAL QLSAETTAYL 
VSMAMIASGI GTWLQVNRYG IVGSGLLSIQ SVNFSFVTVM IALGSSMKSD GFHEELIMSS
LLGVSFVGAF LVVGSSFILP YLRRVITPTV SGIVVLMIGL SLIKVGIIDF GGGFAAKSSG
TFGNYEHLGV GLLVLIVVIG FNCCSSPLLR MGGIAIGLCV GYIASLCLGM VDFSSMRNLP
LITIPHPFKY GFSFSFHQFL VVGTIYLLSV LEAVGDITAT AMVSRRPIQG EEYQSRLKGG
VLADGLVSVI ASAVGSLPLT TFAQNNGVIQ MTGVASRYVG RTIAVMLVIL GLFPMIGGFF
TTIPSAVLGG AMTLMFSMIA IAGIRIIITN GLKRRETLIV ATSLGLGLGV SYDPEIFKIL
PASIYVLVEN PICAGGLTAI LLNIILPGGY RQENVLPGIT SAEEMD