Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0553 |
Symbol | |
ID | 6146612 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 560822 |
End bp | 562276 |
Gene Length | 1455 bp |
Protein Length | 484 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641615447 |
Product | allantoin permease |
Protein accession | YP_001742654 |
Protein GI | 170680776 |
COG category | [F] Nucleotide transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG1953] Cytosine/uracil/thiamine/allantoin permeases |
TIGRFAM ID | [TIGR00800] NCS1 nucleoside transporter family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.962113 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAACATC AGAGAAAACT ATTCCAGCAA CGCGGCTATA GCGAAGATCT ATTGCCGAAA ACGCAAAGCC AGCGGACCTG GAAAACATTT AACTATTTTA CCTTATGGAT GGGTTCGGTT CATAACGTTC CCAATTATGT GATGGTCGGC GGCTTTTTTA TTCTCGGCTT GTCTACCTTT AGTATTATGC TGGCAATTAT CCTCAGCGCC TTTTTCATTG CCGCGGTAAT GGTATTAAAC GGTGCTGCGG GCAGTAAATA CGGTGTGCCT TTTGCCATGA TCCTGCGTGC TTCTTACGGC GTACGTGGTG CACTGTTTCC CGGATTATTA AGAGGCGGGA TTGCTGCCAT TATGTGGTTT GGCCTGCAAT GTTACGCTGG ATCGCTGGCC TGCTTGATCC TGATTGGCAA AATCTGGCCG GGATTTTTAA CTCTCGGTGG TGATTTTACC CTGTTAGGGC TTTCTCTACC GGGTTTAATC ACTTTCTTAC TCTTCTGGCT GGTGAACGTC GGAATCGGTT TCGGTGATGG CAAAGTTTTA AATAAATTCA CTGCCATTCT TAACCCGTGC ATCTATATCG TTTTCGGCGG TATGGCGATT TGGGCGATTT CGCTGGTCGG GATCGGTCCA ATCTTTGACT ATATTCCGGG CGGTATTCAG AAAGCAGAAA ACGGTGGCTT CCTGTTCCTG GTGGTGATTA ACGCGGTAGT TGCGGTCTGG GCGGCACCGG CGGTGAGCGC ATCCGACTTT ACGCAAAACG CCCACTCGTT TCGTGAGCAG GCGCTGGGGC AAACGCTGGG TTTAGTTGTG GCCTATATTC TGTTTGCGGT CGCCGGGGTA TGTATTATTG CCGGAGCCAG TATTCACTAC GGCGCGGACA CCTGGAACGT GCTGGATATT GTTCAGCGTT GGGACAGCCT GTTTGCCTCG TTCTTTGCGG TACTGGTTAT TCTGATGACG ACTATCTCCA CTAACGCCAC CGGTAATATT ATTCCGGCTG GTTATCAGAT AGCCGCTATT GCACCGACGA AACTGACCTA TAAAAACGGC GTACTGATTG CCAGTATTAT CAGCTTGCTG ATCTGCCCGT GGAAATTAAT GGAAAATCAG GACAGCATTT ATCTGTTCCT CGATATTATC GGCGGAATGC TGGGTCCGGT AATTGGTGTC ATGATGGCGC ATTATTTTGT GGTGATGCGC GGACAAATTA ATCTTGATGA ACTGTATACC GCAGCAGGGG ATTTCAAATA TTACGATAAC GGCTTTAACC TTACCGCGTT CTCAGTAACT TTGGTCGCCG TTATTTTATC TCTCGGTGGT AAATTTATTC CGTTCATGGA GCCTTTATCA CGCGTTTCAT GGTTTGTCGG CGTTATTGTC GCCTTTGCGG CCTACGCCTT ATTAAAGAAG CGTACTGCAG CAGAAAAAAC AGGAGAACAA AAAGTCACAG GTTAA
|
Protein sequence | MEHQRKLFQQ RGYSEDLLPK TQSQRTWKTF NYFTLWMGSV HNVPNYVMVG GFFILGLSTF SIMLAIILSA FFIAAVMVLN GAAGSKYGVP FAMILRASYG VRGALFPGLL RGGIAAIMWF GLQCYAGSLA CLILIGKIWP GFLTLGGDFT LLGLSLPGLI TFLLFWLVNV GIGFGDGKVL NKFTAILNPC IYIVFGGMAI WAISLVGIGP IFDYIPGGIQ KAENGGFLFL VVINAVVAVW AAPAVSASDF TQNAHSFREQ ALGQTLGLVV AYILFAVAGV CIIAGASIHY GADTWNVLDI VQRWDSLFAS FFAVLVILMT TISTNATGNI IPAGYQIAAI APTKLTYKNG VLIASIISLL ICPWKLMENQ DSIYLFLDII GGMLGPVIGV MMAHYFVVMR GQINLDELYT AAGDFKYYDN GFNLTAFSVT LVAVILSLGG KFIPFMEPLS RVSWFVGVIV AFAAYALLKK RTAAEKTGEQ KVTG
|
| |