Gene Amuc_1847 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1847 
Symbol 
ID6274748 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2244286 
End bp2245605 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content55% 
IMG OID642613908 
Productdeoxyguanosinetriphosphate triphosphohydrolase 
Protein accessionYP_001878443 
Protein GI187736331 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.809264 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAATT GGCAGCAGCT TTTGTCGGAC GCGCGCTGGG GATCCAGGCA TCGGAGTACG 
GCATCTCCAC AGAAAAACCG CAGCAATTAT GACCGGGATT ACGGCCGCAT TCTGTTTTCC
AGTGCTTTCC GCCGCCTTCA GGACAAAACA CAGGTTTTTC CCCTGGGCCG CAACGATTAC
GTGCGCACAC GCCTGACGCA TAGCCTGGAG GTAGCCCACA TCGGTTCTTC CCTCGGCATG
CTGGCAGGGG AGTGGCTGGC GAAGAAAGGG TCCCTGCCCG CTCCTATCAT GCCTTCCGAT
GTCGCTACCA TTGTTTCCAC CGCCTGCCTG GCCCATGACA TCGGCAATCC GCCCTTCGGC
CATTCCGGTG AAGACGCCAT TGAAACCGCC CTGAAACGGC ACGGCCTGGA GCTGCCTTTT
GAAGGGAATG CCCAGGGATT CCGCATCCTG ACACGTACAG GAGATCCGAT GGAGGGCAAC
GGCCTGAAAC TGACGGCGGC TGTTTTGGGC GCCTTTATGA AATATCCCTG TACGCAATCT
TATTCCTCCG CCGTCAGGAA AGGCAGCATA GTTGCGGCCA ACAAGCTTGA ATGCAAGAAA
TTCGGCATCG GGGAACAGGA AAGGGAAGCT GCTTCTTTCA TGGCTGGACA TCTGGGGTTG
ATTCCGCGTT CCACACCTGA TGCAGCACAT CTTTGCTGGA GCCGCCATCC CCTGGCGTAT
TTGATGGAAG CCGCGGATGA TATCTGTTAC CGCATTGCGG ATATTGAGGA CGGGTATTTT
TCCGGACTTC TGGATTTTGC TTCCACCAGG GATTTGTTCA GCCCTTTCCT GACGGAATCC
CAGCTCTGTT ATGTCCGGGA ACTGGAAGCA AAAGAGGAAA AAGATTCCTG CATCCATTAT
ATGCGCGCGC TCGCCATCGG CAAAAGCATT CAGTCCGCCG TGGACAGTTT CGTGAACCAC
GAGGAAGACC TGCTGCAAGG AAGGTTGGAA CAATCCCTGA TAGACAGCTC GGAACTGGCC
GCCCCGCTGA ACGGCTTGTA CCAGTACGCC ATCAAGAATG TGTACCAGGC CAGGGAGGTT
ATTGAAGTGG AAGCCATGGG TTATAAGGTA CTGGGAGAAT TGATCGACTT CTTTATGGAA
TGGGTGAATC ATCCATCCTC CGGCCAATCC CAAAAAATCG CCATCATGCT TCAGGGCACC
GGCGTACCCC GGAACAACGG CGGGAAGGCT GCACGCCTGG AGCACATGCT TGATTATATT
TCCGGAATGA CGGATTCTTT TGCCCTGGAG ACTTACCGGA AGTTGACCGG AATTCTGTAA
 
Protein sequence
MMNWQQLLSD ARWGSRHRST ASPQKNRSNY DRDYGRILFS SAFRRLQDKT QVFPLGRNDY 
VRTRLTHSLE VAHIGSSLGM LAGEWLAKKG SLPAPIMPSD VATIVSTACL AHDIGNPPFG
HSGEDAIETA LKRHGLELPF EGNAQGFRIL TRTGDPMEGN GLKLTAAVLG AFMKYPCTQS
YSSAVRKGSI VAANKLECKK FGIGEQEREA ASFMAGHLGL IPRSTPDAAH LCWSRHPLAY
LMEAADDICY RIADIEDGYF SGLLDFASTR DLFSPFLTES QLCYVRELEA KEEKDSCIHY
MRALAIGKSI QSAVDSFVNH EEDLLQGRLE QSLIDSSELA APLNGLYQYA IKNVYQAREV
IEVEAMGYKV LGELIDFFME WVNHPSSGQS QKIAIMLQGT GVPRNNGGKA ARLEHMLDYI
SGMTDSFALE TYRKLTGIL