Gene Amuc_2025 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_2025 
Symbol 
ID6274679 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2458357 
End bp2459559 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content58% 
IMG OID642614085 
ProductDEAD/DEAH box helicase domain protein 
Protein accessionYP_001878616 
Protein GI187736504 
COG category[J] Translation, ribosomal structure and biogenesis
[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0513] Superfamily II DNA and RNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.000222317 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTATTTT CAGAATTAGG TTTATCGGAA CCCGTCTTGA AGGCGGTGGA GAAATGCGGT 
TATGAACATC CCACCCCCAT TCAGGAGCAG GCCATTCCCA TCATTCTGGA AGGCAGGGAC
CTCATTGGGG CCTCCCAGAC GGGGACGGGG AAAACCGCTG CTTTCGCCCT CCCGCTGCTG
ACAAGGATTC AGCCCATCGG CAAACCTCAG ATACTGGTGC TGGAACCCAC CAGGGAACTG
GCCGACCAGG TGGCGGAATC CTTTGCCGAA TACGGTGAAT TCACCGGGTT GAAAGTAGCG
TTGCTGTATG GCGGCGTGGG GTACGGAAAG CAGACGGAAG ACCTGAAAAA AGGGGCGGAC
ATCGTTGTGG CCACTCCCGG CCGGCTGGTG GACCACTTCT ACCGCTGCAC CATGCGCTTC
GGAGAAGTCA AGGCCCTGGT TCTGGATGAA GTGGACCGAA TGCTGGACAT GGGGTTCCTG
CCCATTGTCC GTAAAATCGT CAACCTTTGT CCGTGGGAAG GAAGGCAAAC CCTCTTCTTC
TCCGCCACCA TGCCTCCGGT CATCGCGGGA TTTGCCAAAT GGTGCCTGAC GGACCCTGCG
GAAGTTACCA TCGCCCGGCG TGAAGTGGCC GCCACCATCA GCCATGCCTT TTATCCGGTA
GCTCTGGACC AGCGGGATGA ACTGCTGTTG GCCCTGCTCA AGGGGACGGA CTTCCGTTCC
GTCATGATTT TCACCCGCAC CCGCAAGGAG GCGGACGCGG TATGCGGCAT GCTCAAGCAT
CATGGCTACC GCGGGGAGGT GGCCGTCATG CACTCCGACA TTCCCCAGAA GGAACGCATG
GAGGCGCTTA AGGGATTCAA GAGCGGAAAA TATGATATTC TGGTGGCTAC GGATGTGGCG
GCGCGCGGCA TTGACATCAG CGGTGTGACC CACGTCATCA ACTACCGCGT TCCGGAAAAC
GCGGAAGACT ATGTGCACCG CATCGGCCGT ACCGGCCGCG CGGAAGCTTC CGGGGATGCG
TTCACGATCA TGACGGCGGA TGAGCTGGAT TTTGCTGCGG CTGTGGAAAA TTTCATTGGG
AAACCCATTG AACGCAAAAA ACTGGACGGG TTCAACTACA CGTACACCGC CCTGTTGGAA
GACAAGCCCG TCAAATCCGT CCGCAAGCCC AAACCCGCAG GTCCCAAGCG CCGCAGGCGC
TAA
 
Protein sequence
MLFSELGLSE PVLKAVEKCG YEHPTPIQEQ AIPIILEGRD LIGASQTGTG KTAAFALPLL 
TRIQPIGKPQ ILVLEPTREL ADQVAESFAE YGEFTGLKVA LLYGGVGYGK QTEDLKKGAD
IVVATPGRLV DHFYRCTMRF GEVKALVLDE VDRMLDMGFL PIVRKIVNLC PWEGRQTLFF
SATMPPVIAG FAKWCLTDPA EVTIARREVA ATISHAFYPV ALDQRDELLL ALLKGTDFRS
VMIFTRTRKE ADAVCGMLKH HGYRGEVAVM HSDIPQKERM EALKGFKSGK YDILVATDVA
ARGIDISGVT HVINYRVPEN AEDYVHRIGR TGRAEASGDA FTIMTADELD FAAAVENFIG
KPIERKKLDG FNYTYTALLE DKPVKSVRKP KPAGPKRRRR