Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1847 |
Symbol | |
ID | 6274748 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 2244286 |
End bp | 2245605 |
Gene Length | 1320 bp |
Protein Length | 439 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642613908 |
Product | deoxyguanosinetriphosphate triphosphohydrolase |
Protein accession | YP_001878443 |
Protein GI | 187736331 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0232] dGTP triphosphohydrolase |
TIGRFAM ID | [TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.809264 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 63 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGAATT GGCAGCAGCT TTTGTCGGAC GCGCGCTGGG GATCCAGGCA TCGGAGTACG GCATCTCCAC AGAAAAACCG CAGCAATTAT GACCGGGATT ACGGCCGCAT TCTGTTTTCC AGTGCTTTCC GCCGCCTTCA GGACAAAACA CAGGTTTTTC CCCTGGGCCG CAACGATTAC GTGCGCACAC GCCTGACGCA TAGCCTGGAG GTAGCCCACA TCGGTTCTTC CCTCGGCATG CTGGCAGGGG AGTGGCTGGC GAAGAAAGGG TCCCTGCCCG CTCCTATCAT GCCTTCCGAT GTCGCTACCA TTGTTTCCAC CGCCTGCCTG GCCCATGACA TCGGCAATCC GCCCTTCGGC CATTCCGGTG AAGACGCCAT TGAAACCGCC CTGAAACGGC ACGGCCTGGA GCTGCCTTTT GAAGGGAATG CCCAGGGATT CCGCATCCTG ACACGTACAG GAGATCCGAT GGAGGGCAAC GGCCTGAAAC TGACGGCGGC TGTTTTGGGC GCCTTTATGA AATATCCCTG TACGCAATCT TATTCCTCCG CCGTCAGGAA AGGCAGCATA GTTGCGGCCA ACAAGCTTGA ATGCAAGAAA TTCGGCATCG GGGAACAGGA AAGGGAAGCT GCTTCTTTCA TGGCTGGACA TCTGGGGTTG ATTCCGCGTT CCACACCTGA TGCAGCACAT CTTTGCTGGA GCCGCCATCC CCTGGCGTAT TTGATGGAAG CCGCGGATGA TATCTGTTAC CGCATTGCGG ATATTGAGGA CGGGTATTTT TCCGGACTTC TGGATTTTGC TTCCACCAGG GATTTGTTCA GCCCTTTCCT GACGGAATCC CAGCTCTGTT ATGTCCGGGA ACTGGAAGCA AAAGAGGAAA AAGATTCCTG CATCCATTAT ATGCGCGCGC TCGCCATCGG CAAAAGCATT CAGTCCGCCG TGGACAGTTT CGTGAACCAC GAGGAAGACC TGCTGCAAGG AAGGTTGGAA CAATCCCTGA TAGACAGCTC GGAACTGGCC GCCCCGCTGA ACGGCTTGTA CCAGTACGCC ATCAAGAATG TGTACCAGGC CAGGGAGGTT ATTGAAGTGG AAGCCATGGG TTATAAGGTA CTGGGAGAAT TGATCGACTT CTTTATGGAA TGGGTGAATC ATCCATCCTC CGGCCAATCC CAAAAAATCG CCATCATGCT TCAGGGCACC GGCGTACCCC GGAACAACGG CGGGAAGGCT GCACGCCTGG AGCACATGCT TGATTATATT TCCGGAATGA CGGATTCTTT TGCCCTGGAG ACTTACCGGA AGTTGACCGG AATTCTGTAA
|
Protein sequence | MMNWQQLLSD ARWGSRHRST ASPQKNRSNY DRDYGRILFS SAFRRLQDKT QVFPLGRNDY VRTRLTHSLE VAHIGSSLGM LAGEWLAKKG SLPAPIMPSD VATIVSTACL AHDIGNPPFG HSGEDAIETA LKRHGLELPF EGNAQGFRIL TRTGDPMEGN GLKLTAAVLG AFMKYPCTQS YSSAVRKGSI VAANKLECKK FGIGEQEREA ASFMAGHLGL IPRSTPDAAH LCWSRHPLAY LMEAADDICY RIADIEDGYF SGLLDFASTR DLFSPFLTES QLCYVRELEA KEEKDSCIHY MRALAIGKSI QSAVDSFVNH EEDLLQGRLE QSLIDSSELA APLNGLYQYA IKNVYQAREV IEVEAMGYKV LGELIDFFME WVNHPSSGQS QKIAIMLQGT GVPRNNGGKA ARLEHMLDYI SGMTDSFALE TYRKLTGIL
|
| |