Gene EcSMS35_3989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3989 
SymbolyicE 
ID6146722 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4066834 
End bp4068225 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content55% 
IMG OID641618815 
Productxanthine permease 
Protein accessionYP_001745954 
Protein GI170682109 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG2233] Xanthine/uracil permeases 
TIGRFAM ID[TIGR00801] uracil-xanthine permease
[TIGR03173] xanthine permease 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGTTT CCACCCTCGA GTCAGAAAAT GCGCAACCGG TTGCGCAGAC TCAAAACAGC 
GAACTGATTT ACCGTCTTGA AGATCGTCCG CCGCTTCCTC AAACCCTGTT TGCCGCATGT
CAGCATTTGC TGGCGATGTT CGTTGCGGTG ATCACGCCAG CGCTATTAAT CTGCCAGGCG
CTGGGTTTAC CGGCACAAGA CACGCAACAC ATTATTAGTA TGTCGCTGTT TGCCTCCGGT
GTGGCATCGA TTATTCAAAT TAAGGCCTGG GGTCCGGTTG GCTCCGGGTT GTTGTCTATT
CAGGGCACCA GCTTCAACTT TGTTGCTCCG CTGATTATGG GCGGCACAGC GCTGAAGACC
GGTGGTGCTG ATGTTCCTAC CATGATGGCG GCTTTGTTCG GCACGTTGAT GCTGGCAAGT
TGTACCGAGA TGGTGATCTC CCGCGTTCTG CATCTGGCGC GCCGCATTAT TACGCCGCTG
GTTTCTGGCG TTGTGGTGAT GATTATCGGC CTGTCGCTAA TTCAGGTGGG GCTGACCTCC
ATTGGCGGCG GTTACGCAGC CATGAGCGAT AACACCTTCG GCGCACCGAA AAATCTGCTG
CTGGCAGGCG TGGTCTTAGC CTTAATTATC CTGCTTAACC GTCAACGTAA CCCTTACTTA
CGCGTGGCCT CACTGGTGAT TGCGATGGCG GCCGGATATG CGCTGGCGTG GTTTATGGAC
ATGTTGCCAG AAAGCAACGA ACCGATGACG CAAGAACTGA TTATGGTGCC AACGCCGCTC
TATTACGGTC TTGGCATTGA ATGGAGTCTG CTGCTGCCGC TGATGCTGGT CTTTATGATC
ACTTCGCTGG AAACCATTGG CGATATCACG GCGACCTCTG ACGTTTCCGA ACAGCCGGTT
TCCGGTCCGC TGTACATGAA ACGCCTGAAA GGCGGCGTGC TGGCAAACGG CCTGAACTCG
TTTGTCTCGG CGGTATTTAA TACCTTCCCG AACTCCTGCT TCGGGCAGAA CAACGGGGTG
ATCCAGTTAA CCGGTGTTGC CAGCCGCTAT GTCGGTTTTG TCGTCGCGCT GATGTTGATC
GTGCTGGGTC TGTTCCCGGC AGTGAGCGGT TTTGTGCAAC ACATTCCAGA ACCGGTTCTG
GGCGGCGCAA CGCTTGTAAT GTTTGGCACC ATCGCTGCCT CCGGTGTGCG CATTGTGTCT
CGTGAGCCGC TGAACCGTCG GGCGATTCTG ATTATCGCGC TGTCGCTGGC GGTTGGTCTG
GGCGTGTCTC AGCAGCCGCT GATTTTGCAG TTTGCCCCTG AATGGCTGAA AAACCTGCTC
TCCTCCGGGA TCGCCGCGGG CGGTATTACT GCCATCGTGC TGAATCTGAT TTTCCCACCA
GAAAAACAGT AA
 
Protein sequence
MSVSTLESEN AQPVAQTQNS ELIYRLEDRP PLPQTLFAAC QHLLAMFVAV ITPALLICQA 
LGLPAQDTQH IISMSLFASG VASIIQIKAW GPVGSGLLSI QGTSFNFVAP LIMGGTALKT
GGADVPTMMA ALFGTLMLAS CTEMVISRVL HLARRIITPL VSGVVVMIIG LSLIQVGLTS
IGGGYAAMSD NTFGAPKNLL LAGVVLALII LLNRQRNPYL RVASLVIAMA AGYALAWFMD
MLPESNEPMT QELIMVPTPL YYGLGIEWSL LLPLMLVFMI TSLETIGDIT ATSDVSEQPV
SGPLYMKRLK GGVLANGLNS FVSAVFNTFP NSCFGQNNGV IQLTGVASRY VGFVVALMLI
VLGLFPAVSG FVQHIPEPVL GGATLVMFGT IAASGVRIVS REPLNRRAIL IIALSLAVGL
GVSQQPLILQ FAPEWLKNLL SSGIAAGGIT AIVLNLIFPP EKQ