Gene ECH74115_0611 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0611 
Symbol 
ID6967576 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp633494 
End bp634885 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content48% 
IMG OID643384651 
Productallantoin permease 
Protein accessionYP_002269165 
Protein GI209399459 
COG category[F] Nucleotide transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG1953] Cytosine/uracil/thiamine/allantoin permeases 
TIGRFAM ID[TIGR00800] NCS1 nucleoside transporter family 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACATC AGAGAAAACT ATTCCAGCAA CGCGGCTATA GCGAAGATCT ATTGCCGAAA 
ACGCAAAGCC AGCGGACCTG GAAAACATTT AACTATTTTA CCTTATGGAT GGGTTCGGTT
CATAACGTTC CCAATTATGT GATGGTCGGC GGCTTTTTTA TTCTCGGCTT GTCTACCTTT
AGTATTATGC TGGCAATTAT CCTCAGCGCC TTTTTCATTG CCGCGGTAAT GGTATTAAAC
GGCGCTGCGG GCAGTAAATA CGGCGTACCG TTTGCCATGA TCCTGCGTGC TTCTTACGGC
GTACGTGGCG CACTGTTTCC CGGATTATTA AGAGGCGGTA TTGCTGCCAT TATGTGGTTT
GGCCTGCAAT GTTACGCGGG GTCACTGGCC TGCTTGATTC TGATTGGCAA AATCTGGCCG
GGATTTTTGA CTCTCGGTGG TGATTTCACT CTGTTAGGCC TTTCTCTACC GGGCTTAATT
ACTTTCTTAC TCTTCTGGCT GGTCAACGTT GGTATAGGTT TTGGCGGTGG CAAAGTTTTA
AATAAATTCA CTGCCATTCT TAACCCGTGC ATCTATATCG TTTTCGGCGG TATGGCGATT
TGGGCGATTT CGCTGGTCGG GCTCGGTCCA ATCTTTGACT ACATTCCGAG CGGTATTCAG
AAAGCAGAAA ACAGTGGATT CTTGTTCCTG GTGGTGATTA ACGCGGTAGT TGCGGTCTGG
GCGGCACCGG CGGTGAGCGC ATCCGACTTT ACGCAAAACG CCCACTCGTT TCGTGAGCAA
GCGCTGGGGC AAACGCTGGG TTTAGTTGTG GCCTATATTC TGTTTGCGGT CGCGGGGGTA
TGTATTATTG CCGGAGCCAG TATTCATTAC GGCGCTGATA CCTGGAACGT GCTGGATATT
GTTCAGCGTT GGGACAGCCT GTTTGCCTCG TTCTTTGCGG TACTGGTTAT TCTGATGACG
ACTATCTCCA CTAACGCCAC TGGTAATATT ATTCCGGCTG GTTATCAGAT TGCTGCCATT
GCACCGACAA AACTGACCTA TAAAAACGGC GTACTGATTG CCAGTATTAT CAGCCTGCTG
ATCTGCCCGT GGAAATTAAT GGAAAATCAG GACAGTATTT ATCTGTTCCT CGATATTATC
GGCGGAATGC TTGGTCCGGT AATTGGTGTC ATGATGGCGC ATTATTTTGT GGTGATGCGT
GGGCAAATTA ATCTTGATGA ACTGTACACC GCACCTGGCG ATTATAAATA TTACGATAAC
GGTTTTAACC TCACCGCGTT TTCAGTAACT CTGGTGGCCG TTATTTTATC TCTTGGCGGT
AAGTTTATTC CCTTAATGGA ACCGTTATCG CGTGTTTCAT GGTTTGTCGG CGTCATCGTC
GCCTTTGCTT GA
 
Protein sequence
MEHQRKLFQQ RGYSEDLLPK TQSQRTWKTF NYFTLWMGSV HNVPNYVMVG GFFILGLSTF 
SIMLAIILSA FFIAAVMVLN GAAGSKYGVP FAMILRASYG VRGALFPGLL RGGIAAIMWF
GLQCYAGSLA CLILIGKIWP GFLTLGGDFT LLGLSLPGLI TFLLFWLVNV GIGFGGGKVL
NKFTAILNPC IYIVFGGMAI WAISLVGLGP IFDYIPSGIQ KAENSGFLFL VVINAVVAVW
AAPAVSASDF TQNAHSFREQ ALGQTLGLVV AYILFAVAGV CIIAGASIHY GADTWNVLDI
VQRWDSLFAS FFAVLVILMT TISTNATGNI IPAGYQIAAI APTKLTYKNG VLIASIISLL
ICPWKLMENQ DSIYLFLDII GGMLGPVIGV MMAHYFVVMR GQINLDELYT APGDYKYYDN
GFNLTAFSVT LVAVILSLGG KFIPLMEPLS RVSWFVGVIV AFA