Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0611 |
Symbol | |
ID | 6967576 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 633494 |
End bp | 634885 |
Gene Length | 1392 bp |
Protein Length | 463 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 643384651 |
Product | allantoin permease |
Protein accession | YP_002269165 |
Protein GI | 209399459 |
COG category | [F] Nucleotide transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG1953] Cytosine/uracil/thiamine/allantoin permeases |
TIGRFAM ID | [TIGR00800] NCS1 nucleoside transporter family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAACATC AGAGAAAACT ATTCCAGCAA CGCGGCTATA GCGAAGATCT ATTGCCGAAA ACGCAAAGCC AGCGGACCTG GAAAACATTT AACTATTTTA CCTTATGGAT GGGTTCGGTT CATAACGTTC CCAATTATGT GATGGTCGGC GGCTTTTTTA TTCTCGGCTT GTCTACCTTT AGTATTATGC TGGCAATTAT CCTCAGCGCC TTTTTCATTG CCGCGGTAAT GGTATTAAAC GGCGCTGCGG GCAGTAAATA CGGCGTACCG TTTGCCATGA TCCTGCGTGC TTCTTACGGC GTACGTGGCG CACTGTTTCC CGGATTATTA AGAGGCGGTA TTGCTGCCAT TATGTGGTTT GGCCTGCAAT GTTACGCGGG GTCACTGGCC TGCTTGATTC TGATTGGCAA AATCTGGCCG GGATTTTTGA CTCTCGGTGG TGATTTCACT CTGTTAGGCC TTTCTCTACC GGGCTTAATT ACTTTCTTAC TCTTCTGGCT GGTCAACGTT GGTATAGGTT TTGGCGGTGG CAAAGTTTTA AATAAATTCA CTGCCATTCT TAACCCGTGC ATCTATATCG TTTTCGGCGG TATGGCGATT TGGGCGATTT CGCTGGTCGG GCTCGGTCCA ATCTTTGACT ACATTCCGAG CGGTATTCAG AAAGCAGAAA ACAGTGGATT CTTGTTCCTG GTGGTGATTA ACGCGGTAGT TGCGGTCTGG GCGGCACCGG CGGTGAGCGC ATCCGACTTT ACGCAAAACG CCCACTCGTT TCGTGAGCAA GCGCTGGGGC AAACGCTGGG TTTAGTTGTG GCCTATATTC TGTTTGCGGT CGCGGGGGTA TGTATTATTG CCGGAGCCAG TATTCATTAC GGCGCTGATA CCTGGAACGT GCTGGATATT GTTCAGCGTT GGGACAGCCT GTTTGCCTCG TTCTTTGCGG TACTGGTTAT TCTGATGACG ACTATCTCCA CTAACGCCAC TGGTAATATT ATTCCGGCTG GTTATCAGAT TGCTGCCATT GCACCGACAA AACTGACCTA TAAAAACGGC GTACTGATTG CCAGTATTAT CAGCCTGCTG ATCTGCCCGT GGAAATTAAT GGAAAATCAG GACAGTATTT ATCTGTTCCT CGATATTATC GGCGGAATGC TTGGTCCGGT AATTGGTGTC ATGATGGCGC ATTATTTTGT GGTGATGCGT GGGCAAATTA ATCTTGATGA ACTGTACACC GCACCTGGCG ATTATAAATA TTACGATAAC GGTTTTAACC TCACCGCGTT TTCAGTAACT CTGGTGGCCG TTATTTTATC TCTTGGCGGT AAGTTTATTC CCTTAATGGA ACCGTTATCG CGTGTTTCAT GGTTTGTCGG CGTCATCGTC GCCTTTGCTT GA
|
Protein sequence | MEHQRKLFQQ RGYSEDLLPK TQSQRTWKTF NYFTLWMGSV HNVPNYVMVG GFFILGLSTF SIMLAIILSA FFIAAVMVLN GAAGSKYGVP FAMILRASYG VRGALFPGLL RGGIAAIMWF GLQCYAGSLA CLILIGKIWP GFLTLGGDFT LLGLSLPGLI TFLLFWLVNV GIGFGGGKVL NKFTAILNPC IYIVFGGMAI WAISLVGLGP IFDYIPSGIQ KAENSGFLFL VVINAVVAVW AAPAVSASDF TQNAHSFREQ ALGQTLGLVV AYILFAVAGV CIIAGASIHY GADTWNVLDI VQRWDSLFAS FFAVLVILMT TISTNATGNI IPAGYQIAAI APTKLTYKNG VLIASIISLL ICPWKLMENQ DSIYLFLDII GGMLGPVIGV MMAHYFVVMR GQINLDELYT APGDYKYYDN GFNLTAFSVT LVAVILSLGG KFIPLMEPLS RVSWFVGVIV AFA
|
| |