Gene ECH74115_4172 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4172 
Symbol 
ID6970237 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3866898 
End bp3868298 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content49% 
IMG OID643387918 
Productxanthine/uracil permease family protein 
Protein accessionYP_002272357 
Protein GI209400599 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG2233] Xanthine/uracil permeases 
TIGRFAM ID[TIGR00801] uracil-xanthine permease
[TIGR03173] xanthine permease 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones79 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGATA TAAACCATGC AGGTTCTGAC CTTATATTTG AACTGGAGGA TCGCCCTCCC 
TTTCATCAGG CTCTCGTTGG TGCCATTACC CATCTGTTGG CAATTTTCGT TCCGATGGTA
ACCCCCGCGT TAATCGTGGG GGCGGCCTTA CAGCTTTCCG CTGAAACAAC TGCCTATCTT
GTTTCAATGG CGATGATCGC CTCTGGTATT GGTACCTGGT TACAAGTAAA CCGCTATGGC
ATCGTCGGTT CTGGCCTACT CTCAATTCAG TCAGTCAATT TTTCATTTGT TACGGTCATG
ATTGCGCTGG GCAGCAGCAT GAAAAGCGAC GGTTTTCACG AAGAGTTAAT CATGTCGTCG
CTCCTCGGCG TCTCCTTCGT TGGCGCATTT CTGGTTGTCG GATCTTCATT TATCTTGCCC
TATTTACGTC GGGTTATTAC GCCTACTGTC AGCGGTATTG TGGTACTGAT GATCGGCTTA
AGCCTGATTA AAGTCGGCAT TATCGATTTT GGTGGAGGAT TTGCTGCCAA AAGCAGCGGT
ACCTTCGGCA ATTACGAACA TCTCGGCGTT GGTTTATTGG TTTTGATTGT GGTGATCGGC
TTTAACTGCT GTCGCAGTCC GTTGCTACGC ATGGGAGGGA TCGCCATTGG GCTATGCGTC
GGTTATATCG CGTCGTTATG CCTGGGCATG GTGGATTTCA GCAGTATGCG CAATTTGCCG
TTAATCACCA TCCCACATCC GTTCAAATAC GGCTTTAGTT TTAGCTTCCA TCAGTTCCTG
GTGGTTGGCA CGATTTATCT GCTTAGCGTG CTGGAAGCTG TCGGCGATAT CACCGCCACG
GCAATGGTTT CCCGCCGACC CATTCAGGGA GAAGAGTATC AGTCCCGACT GAAAGGCGGC
GTGCTGGCAG ACGGTCTGGT TTCTGTTATC GCCTCCGCTG TCGGGTCATT ACCCTTAACC
ACGTTTGCGC AAAATAATGG GGTTATTCAG ATGACTGGCG TCGCTTCACG TTATGTCGGG
CGAACCATCG CGGTAATGCT GGTTATCCTC GGTTTGTTTC CGATGATTGG CGGCTTCTTC
ACGACCATTC CCTCGGCAGT TCTGGGAGGC GCAATGACGT TGATGTTTTC CATGATTGCC
ATCGCTGGGA TTCGCATCAT CATCACCAAC GGTTTAAAGC GCCGAGAAAC ACTTATTGTC
GCCACTTCTT TAGGTTTAGG ACTTGGCGTC TCCTACGATC CCGAAATTTT TAAAATATTG
CCAGCCTCTA TTTATGTATT AGTTGAAAAC CCTATTTGTG CTGGCGGGTT AACTGCGATT
TTATTAAATA TTATCCTCCC TGGTGGCTAC CGACAGGAAA ACGTTCTGCC TGGTATTACC
TCAGCGGAAG AGATGGATTA A
 
Protein sequence
MSDINHAGSD LIFELEDRPP FHQALVGAIT HLLAIFVPMV TPALIVGAAL QLSAETTAYL 
VSMAMIASGI GTWLQVNRYG IVGSGLLSIQ SVNFSFVTVM IALGSSMKSD GFHEELIMSS
LLGVSFVGAF LVVGSSFILP YLRRVITPTV SGIVVLMIGL SLIKVGIIDF GGGFAAKSSG
TFGNYEHLGV GLLVLIVVIG FNCCRSPLLR MGGIAIGLCV GYIASLCLGM VDFSSMRNLP
LITIPHPFKY GFSFSFHQFL VVGTIYLLSV LEAVGDITAT AMVSRRPIQG EEYQSRLKGG
VLADGLVSVI ASAVGSLPLT TFAQNNGVIQ MTGVASRYVG RTIAVMLVIL GLFPMIGGFF
TTIPSAVLGG AMTLMFSMIA IAGIRIIITN GLKRRETLIV ATSLGLGLGV SYDPEIFKIL
PASIYVLVEN PICAGGLTAI LLNIILPGGY RQENVLPGIT SAEEMD