Gene Amuc_1518 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1518 
Symbol 
ID6274622 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1812644 
End bp1813654 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content58% 
IMG OID642613577 
ProductEndonuclease/exonuclease/phosphatase 
Protein accessionYP_001878120 
Protein GI187736008 
COG category[R] General function prediction only 
COG ID[COG3568] Metal-dependent hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000454364 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.00000000880332 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATCAGGA AAACGTCAAA ACGCAGGGGA ACCTCCGTCA TTGCGGCCCT GATTGCGTTG 
TGCGCGGTGG TGGGGTACGG TTTGACGGAG TGGGCGCCGC TGGATAATGA ACCGCAGGCG
GTTCCCTCCC GCCAGAGGGA AGAGCGGCAG ACGGTTCGGG AGAAGGTGGA AATCCCCGAC
AAGGGGGAGC CCGTTCGTTT GCTGACCATG AACGCCGGAA ACTACTTTGT GCCGGAAGAC
CCGAGGAGAA GCAATTTTCA GGTAAAATAC AAGCCTGTGG AAGCCCGTGA AGCCGTGGCG
GAGCTGGTTC GCCAATCGGG GGCGGAAATC GTGGGGCTGT GTGAAATGGG CGGGGAGGCT
GCCGTTCGTG ACTTGCAAAT GCGGCTGAAA AGAAAAGGAG TTCATTTGCC GTACAAAGTT
CTTGTCATGC GGGACGGGGA GGACCGTGGT TTGGCCCTTC TTTCCAAATA CCGCATCGCG
GATGACCGTT CCGTAACGGA CATGCCTGTA TCCGGAGAGG CGAAACGGAA AAAGACGATG
CTGCGGGGCA TTCTGGACGC CACGGTCAGC ATGCCGGACG GACGGCTGTT CCGCCTGGTG
GGCATTCATC TGAAATCACG CCTCAGCCGT GACGGTTCCG CAGAAGACAC ACGGAGAAGG
GAAGCCTACG CCCTGCGGGA CTACCTGAAT GAAGCTCTTG CCTCTCAGGA CGGCATGCCT
CTGCTTCTGT ACGGAGATTT TAATGACGGC CCGTCAGACA GCGCCGTGCA GGTCATCCAG
GGGCCGGCCA AAACGGAATA CCGCCTGAAC CGTTTGAAGC CCAGGGATTC TCGTGGTGAG
ACCTGGACCA TTTACTACGA AGACGGTGAC ACCTACCATT CCTTCGACCA TCTTTTCCTG
AACAATACTC TGAAAAAGCG CCTCGGCCGC AAGCCTCCCA TGGGCATCCT TGACTCTCCC
CCCTCGCTCC AGGCCAGCGA CCACCGCGGC GTGTGGGTGG AATTAAGGTA G
 
Protein sequence
MIRKTSKRRG TSVIAALIAL CAVVGYGLTE WAPLDNEPQA VPSRQREERQ TVREKVEIPD 
KGEPVRLLTM NAGNYFVPED PRRSNFQVKY KPVEAREAVA ELVRQSGAEI VGLCEMGGEA
AVRDLQMRLK RKGVHLPYKV LVMRDGEDRG LALLSKYRIA DDRSVTDMPV SGEAKRKKTM
LRGILDATVS MPDGRLFRLV GIHLKSRLSR DGSAEDTRRR EAYALRDYLN EALASQDGMP
LLLYGDFNDG PSDSAVQVIQ GPAKTEYRLN RLKPRDSRGE TWTIYYEDGD TYHSFDHLFL
NNTLKKRLGR KPPMGILDSP PSLQASDHRG VWVELR