Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_0547 |
Symbol | |
ID | 5587684 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 572588 |
End bp | 574042 |
Gene Length | 1455 bp |
Protein Length | 484 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640924269 |
Product | allantoin permease |
Protein accession | YP_001461696 |
Protein GI | 157156624 |
COG category | [F] Nucleotide transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG1953] Cytosine/uracil/thiamine/allantoin permeases |
TIGRFAM ID | [TIGR00800] NCS1 nucleoside transporter family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACATC AGAGAAAACT ATTCCAGCAA CGCGGCTATA GCGAAGATCT ATTGCCGAAA ACGCAAAGCC AGCGGACCTG GAAAACATTT AACTATTTTA CCTTATGGAT GGGTTCGGTT CATAACGTTC CCAATTATGT GATGGTCGGC GGCTTTTTTA TTCTCGGCTT GTCTACCTTT AGTATTATGC TGGCAATTAT CCTCAGCGCC TTTTTCATTG CCGCGGTAAT GGTATTAAAC GGCGCTGCGG GCAGTAAATA CGGCGTACCG TTTGCCATGA TCCTGCGTGC TTCTTACGGC GTACGTGGCG CACTGTTTCC CGGATTATTA AGAGGCGGTA TTGCTGCCAT TATGTGGTTT GGCCTGCAAT GTTACGCGGG GTCACTGGCC TGCTTGATTC TGATTGGCAA AATCTGGCCG GGATTTTTAA CTCTCGGTGG TGATTTCACG CTGTTAGGGC TTTCTCTACC GGGCTTAATT ACTTTCTTAC TCTTCTGGCT GGTCAACGTT GGTATAGGTT TTGGCGGTGG CAAAGTTTTA AATAAATTCA CTGCCATTCT TAACCCGTGC ATCTATATCG TTTTCGGCGG CATGGCGATT TGGGCGATTT CACTGGTCGG GATCGGTCCA ATCTTTGACT ACATTCCGAG CGGTATTCAG AAAGCAGAAA ACAGTGGATT CTTGTTCCTG GTGGTGATTA ACGCGGTAGT TGCGGTCTGG GCGGCACCGG CGGTGAGCGC ATCCGACTTT ACGCAAAACG CCCACTCGTT TCGTGAGCAA GCGCTGGGGC AAACGCTGGG TTTAGTTGTG GCCTATATTC TGTTTGCGGT CGCTGGGGTA TGTATTATTG CCGGAGCCAG TATTCACTAC GGCGCTGACA CCTGGAACGT GCTGGATATT GTTCAGCGTT GGGACAGCCT GTTTGCCTCG TTCTTTGCGG TACTGGTTAT TCTGATGACG ACTATTTCCA CTAACGCCAC CGGTAATATT ATTCCGGCCG GTTATCAGAT TGCTGCCATT GCACCGACAA AACTGACCTA TAAAAACGGC GTACTGATTG CCAGTATTAT CAGCCTGCTG ATCTGCCCGT GGAAATTAAT GGAAAATCAG GACAGTATTT ATCTGTTCCT CGATATCATC GGCGGAATGC TGGGTCCGGT AATTGGTGTC ATGATGGCAC ATTATTTTGT GGTGATGCGC GGACAAATTA ATCTTGATGA ACTGTATACC GCACCTGGCG ATTATAAATA TTACGATAAC GGTTTTAACC TCACTGCGTT TTCAGTAACT CTGGTGGCCG TTATTTTATC TCTTGGCGGT AAGTTTATTC ACTTTATGGA ACCGTTATCG CGTGTTTCAT GGTTTGTCGG CGTCATCGTC GCCTTTGCGG CCTACGCCTT ATTAAAGAAA CGTACAACAG CAGAAAAAAC AGGAGAGCAA AAAACCATAG GTTAA
|
Protein sequence | MEHQRKLFQQ RGYSEDLLPK TQSQRTWKTF NYFTLWMGSV HNVPNYVMVG GFFILGLSTF SIMLAIILSA FFIAAVMVLN GAAGSKYGVP FAMILRASYG VRGALFPGLL RGGIAAIMWF GLQCYAGSLA CLILIGKIWP GFLTLGGDFT LLGLSLPGLI TFLLFWLVNV GIGFGGGKVL NKFTAILNPC IYIVFGGMAI WAISLVGIGP IFDYIPSGIQ KAENSGFLFL VVINAVVAVW AAPAVSASDF TQNAHSFREQ ALGQTLGLVV AYILFAVAGV CIIAGASIHY GADTWNVLDI VQRWDSLFAS FFAVLVILMT TISTNATGNI IPAGYQIAAI APTKLTYKNG VLIASIISLL ICPWKLMENQ DSIYLFLDII GGMLGPVIGV MMAHYFVVMR GQINLDELYT APGDYKYYDN GFNLTAFSVT LVAVILSLGG KFIHFMEPLS RVSWFVGVIV AFAAYALLKK RTTAEKTGEQ KTIG
|
| |