Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2105 |
Symbol | amn |
ID | 5594450 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 2093664 |
End bp | 2095118 |
Gene Length | 1455 bp |
Protein Length | 484 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640921244 |
Product | AMP nucleosidase |
Protein accession | YP_001458786 |
Protein GI | 157161468 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0775] Nucleoside phosphorylase |
TIGRFAM ID | [TIGR01717] AMP nucleosidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.000000000125567 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAATA AGGGCTCCGG TCTGACCCCA GCTCAGGCAC TGGATAAACT CGACGCGCTG TATGAGCAAT CTGTAGTCGC ATTACGCAAC GCCATTGGCA ACTATATTAC CAGTGGCGAA TTACCTGATG AAAACGCCCG CAAACAAGGT CTTTTTGTCT ATCCATCACT GACCGTAACC TGGGACGGTA GCACAACCAA TCCCCCCAAA ACGCGCGCAT TTGGTCGTTT TACTCACGCA GGCAGCTACA CCACCACGAT TACTCGCCCT ACTCTCTTTC GTTCGTATCT TAATGAACAA CTTACGTTGC TGTATCAGGA TTATGGTGCG CATATCTCTG TGCAACCCTC GCAGCATGAA ATCCCTTATC CTTATGTCAT CGATGGCTCT GAATTGACAC TTGATCGCTC AATGAGCGCT GGGTTAACTC GCTACTTCCC GACAACAGAA CTGGCGCAAA TTGGCGATGA AACTGCAGAC GGCATTTATC ATCCAACGGA ATTCTCCCCG CTATCGCATT TTGATGCGCG CCGCGTCGAT TTTTCCCTCG CACGGTTGCG CCATTATACC GGTACGCCAG TTGAACATTT TCAGCCGTTC GTCTTGTTTA CCAACTACAC ACGTTATGTG GATGAATTCG TTCGTTGGGG ATGCAGCCAG ATCCTCGATC CTGATAGTCC TTACATTGCC CTTTCTTGTG CTGGCGGGAA CTGGATCACC GCCGAAACCG AAGCGCCAGA AGAAGCCATT TCCGACCTTG CATGGAAAAA ACACCAGATG CCAGCATGGC ATTTAATTAC CGCCGATGCT CAGGGTATTA CTCTGGTGAA TATTGGCGTG GGACCGTCCA ATGCTAAAAC CATCTGCGAT CATCTGGCAG TGCTACGCCC GGATGTCTGG TTGATGATTG GTCACTGTGG CGGATTACGT GAAAGTCAGG CCATTGGCGA TTATGTACTT GCACACGCTT ATTTACGCGA TGACCACGTT CTTGATGCGG TTCTGCCGCC CGATATTCCT ATTCCGAGCA TTGCTGAAGT GCAACGTGCG CTTTATGACG CCACCAAGCT GGTGAGTGGC AGGCCCGGTG AGGAAGTCAA ACAGCGGCTA CGTACAGGTA CTGTGGTAAC CACAGATGAC AGGAACTGGG AATTACGTTA CTCAGCTTCT GCACTTCGTT TTAACTTAAG CCGGGCCGTA GCAATTGATA TGGAAAGTGC AACCATTGCC GCGCAAGGAT ATCGTTTCCG CGTGCCATAC GGGACACTAC TGTGTGTTTC AGATAAACCG TTGCATGGCG AGATTAAACT TCCCGGTCAG GCTAACCGTT TTTATGAAGG TGCTATTTCC GAACACCTGC AAATTGGCAT TCGGGCGATC GATTTGCTGC GCGCAGAAGG CGACCGACTG CATTCGCGTA AATTACGAAC CTTTAATGAG CCGCCGTTCC GATAA
|
Protein sequence | MNNKGSGLTP AQALDKLDAL YEQSVVALRN AIGNYITSGE LPDENARKQG LFVYPSLTVT WDGSTTNPPK TRAFGRFTHA GSYTTTITRP TLFRSYLNEQ LTLLYQDYGA HISVQPSQHE IPYPYVIDGS ELTLDRSMSA GLTRYFPTTE LAQIGDETAD GIYHPTEFSP LSHFDARRVD FSLARLRHYT GTPVEHFQPF VLFTNYTRYV DEFVRWGCSQ ILDPDSPYIA LSCAGGNWIT AETEAPEEAI SDLAWKKHQM PAWHLITADA QGITLVNIGV GPSNAKTICD HLAVLRPDVW LMIGHCGGLR ESQAIGDYVL AHAYLRDDHV LDAVLPPDIP IPSIAEVQRA LYDATKLVSG RPGEEVKQRL RTGTVVTTDD RNWELRYSAS ALRFNLSRAV AIDMESATIA AQGYRFRVPY GTLLCVSDKP LHGEIKLPGQ ANRFYEGAIS EHLQIGIRAI DLLRAEGDRL HSRKLRTFNE PPFR
|
| |