Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1143 |
Symbol | amn |
ID | 6143875 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1160451 |
End bp | 1161905 |
Gene Length | 1455 bp |
Protein Length | 484 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641616021 |
Product | AMP nucleosidase |
Protein accession | YP_001743210 |
Protein GI | 170679849 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0775] Nucleoside phosphorylase |
TIGRFAM ID | [TIGR01717] AMP nucleosidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0000908375 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.00000131711 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAATAATA AGGGCTCCGG TCTGACCCCA GCTCAGGCAC TGGATAAACT CGACGCGCTG TATGAGCAAT CTGTAGTCGC ATTACGCAAC GCCATTGGCA ACTATATTAC AAGTGGCGAA TTACCTGATG AAAACGCCCG CAAACAAGGT CTTTTTGTCT ATCCATCACT GACCGTAACC TGGGACGGTA GCACAACCAA TCCCCCCAAA ACGCGCGCAT TTGGTCGTTT TACTCACGCA GGCAGCTACA CCACCACGAT TACTCGCCCT ACTCTCTTTC GTTCGTATCT TAATGAACAA CTTACGTTGC TGTATCAGGA TTATGGTGCG CATATCTCTG TGCAACCCTC GCAGCATGAA ATCCCTTATC CTTATGTCAT CGATGGCTCT GAATTGACAC TTGATCGCTC AATGAGCGCT GGGTTAACTC GCTACTTCCC GACAACAGAA CTGGCGCAAA TTGGCGATGA AACTGCAGAC GGCATTTATC ATCCAACTGA ATTCTCCCCC CTATCGCATT TTGATGCGCG CCGCGTCGAT TTTTCCCTCG CACGGTTGCG CCATTATACC GGTACGCCAG TTGAACATTT TCAGCCGTTC GTCTTGTTTA CCAACTACAC ACGTTATGTG GATGAATTCG TTCGTTGGGG ATGCAGCCAG ATCCTCGATC CTGATAGTCC CTACATTGCC CTTTCTTGTG CTGGCGGGAA CTGGATCACC GCCGAAACCG AAGCGCCAGA AGAAGCCATT TCCGACCTTG CATGGAAAAA ACATCAGATG CCAGCATGGC ATTTAATTAC CGCCGATGGT CAGGGTATTA CTCTGGTGAA TATTGGCGTG GGACCGTCAA ATGCTAAAAC CATCTGCGAT CATCTGGCAG TGCTACGCCC GGATGTCTGG TTGATGATTG GTCACTGTGG CGGATTACGT GAAAGTCAGG CCATTGGCGA TTATGTACTT GCACACGCTT ATTTACGCGA TGACCACGTT CTTGATGCGG TTCTGCCGCC CGATATTCCT ATTCCGAGCA TTGCTGAAGT GCAACGTGCG CTTTATGACG CCACCAAGCT GGTGAGTGGC AGGCCCGGTG AGGAAGTCAA ACAGCGGCTA CGTACTGGTA CTGTGGTAAC CACAGATGAC AGGAACTGGG AATTACGTTA CTCAGCTTCT GCACTTCGTT TTAACTTAAG CCGGGCCGTA GCAATTGATA TGGAAAGTGC AACCATTGCC GCGCAAGGAT ATCGTTTCCG CGTGCCATAC GGGACACTAC TGTGTGTTTC AGATAAACCG TTGCATGGCG AGATTAAACT TCCCGGTCAG GCTAACCGTT TTTATGAAGG CGCTATTTCC GAACACCTAC AAATTGGCAT TCGGGCGATC GATTTGCTGC GCGCAGAAGG CGACCGACTG CATTCACGTA AATTACGAAC CTTTAATGAG CCGCCGTTCC GATAA
|
Protein sequence | MNNKGSGLTP AQALDKLDAL YEQSVVALRN AIGNYITSGE LPDENARKQG LFVYPSLTVT WDGSTTNPPK TRAFGRFTHA GSYTTTITRP TLFRSYLNEQ LTLLYQDYGA HISVQPSQHE IPYPYVIDGS ELTLDRSMSA GLTRYFPTTE LAQIGDETAD GIYHPTEFSP LSHFDARRVD FSLARLRHYT GTPVEHFQPF VLFTNYTRYV DEFVRWGCSQ ILDPDSPYIA LSCAGGNWIT AETEAPEEAI SDLAWKKHQM PAWHLITADG QGITLVNIGV GPSNAKTICD HLAVLRPDVW LMIGHCGGLR ESQAIGDYVL AHAYLRDDHV LDAVLPPDIP IPSIAEVQRA LYDATKLVSG RPGEEVKQRL RTGTVVTTDD RNWELRYSAS ALRFNLSRAV AIDMESATIA AQGYRFRVPY GTLLCVSDKP LHGEIKLPGQ ANRFYEGAIS EHLQIGIRAI DLLRAEGDRL HSRKLRTFNE PPFR
|
| |