Gene Amuc_1860 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1860 
Symbol 
ID6275480 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2262706 
End bp2264379 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content58% 
IMG OID642613921 
ProductFormate--tetrahydrofolate ligase 
Protein accessionYP_001878455 
Protein GI187736343 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG2759] Formyltetrahydrofolate synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.458674 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value0.41641 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCATGA ACCTTCTTGC CAAAAGAGGG GGCTTCCGTC ATCATGTGCG CATGGTTACA 
CCTGCATTCT CCGACTGTAT CAACAAGCTG GGCGTCGATG AAAAAGACGT CATTCCCTTC
GGCCGCAACA AGGCCAAAAT TTCCCTGGAC GTATTGGACA AGCCGGCAAC CCCCGGCAAA
CTCATCCTGG TATCCGCCAT CACTCCCACC CCATCCGGAG AAGGCAAAAC CACCGTTTCC
ATCGGCTTGG CGCAGGGATT GCAGGCCATC GGCAAAAAAG TCTGCCTGGC GCTCCGGCAA
CCTTCCATGG GACCGGTGTT CGGCCGCAAG GGCGGAGCTA CCGGAGGCGG CAAAAGCTCC
CTGACGCCGG CAGAGGAAAT CAACATGCAT TTCACAGGGG ACTTCCACGC CATTACATCC
GCCCACAACC TCATCAGCGC CATCATTGAT AACGCCATGT TCTTCCACAC GCTGAACATT
GACGAACGCA AAGTCATATG GAAGCGGGTC ATGGACATGA ACGACCGTTC CCTGCGTTCC
ATCATCGTGG GGCTCAACAA GCAGGGGTTC CCCCGAGAAA CAGGCTTCGA TATCACCCCG
GCTTCGGAAA TCATGGCGTG CCTGTGCCTT GCCACTTCCT ACAAGGACAT GGAAGAACGC
ATCAACCGCA TCGTGATCGG ATTCACGACG GATGACAAGC CCGTATTCGC AAGGGAACTG
GGCATTACCG GTTCCGTCAT GTCCCTGCTG AAAGATGCCC TGATGCCCAA TCTGGTTCAA
TCCGTGGAAG GAGTGCCCTG TTTCCTGCAC GGCGGCCCGT TTGCCAACAT TGCCCACGGC
TGCAACTCCG TGCTGGCCAC CAGGATGGCC CTGCATTTTG GGGACTATGC CGTCACGGAA
GCCGGATTCG CGTTTGACCT GGGTGCAGAG AAATTCCTGG ATATCAAATG CCGCCAGTCC
GGACTGGATC CGGCGGCCAT TGTCATCGTA GCTACGGCAC GCGCGCTGAA AATGCACGGG
GGAACTGCCC TTGCGGATCT GAAAAACACG GATGTGGATG CGCTGAAAAA AGGCCTTGCC
AACCTGGATG CGCATCTGGA CGCTGCCGCC CACTACAAAC GCCCCGTTGT GGTCGCCGTC
AACAAATTCT TTGACGACTC CCGGGAAGAA CTGGACGCTA TCGTGAAACA CTGCGCGGAA
CGCGGTATTC CCTGCGCCAT TGCGGATATC TTCTCCCAGG GAGGAGAAGG AGGCAAAGAC
CTGGCCCAAA TGGTTGTGGA AGCTGCGGAC AGGAGTTCCG CCCCTTTCAA GCCCCTCTAT
GAATCCGCCC TGCCGGTGGA AGAAAAACTC AACATCATTG CCCGCAACAT TTACGGGGCG
GACGGTGTGG AATTGACAGC CGCCGCCAAA AAGAAGCTGG CCCAGTTTGA AGCCAGCCGC
CTGACGGACC TTCCCATCTG CATGGCAAAA ACCCAGAACT CCCTTTCCGA CAACGGACGC
CTCCGAGGAC GCCCCACCGG CTTCACCATC ACCGTGCGCG ACTTTGAAAT CGCCAACGGG
GCCGGCTTCC TGGTGGCCCT CTGCGGGGAA ATCATGCGCA TGCCCGCCCT TCCCGTCTCA
CCGAACGCCA TGCACATCTA CCTGGATGAC AAGGGCAACG TCCAGGGGCT CTGA
 
Protein sequence
MPMNLLAKRG GFRHHVRMVT PAFSDCINKL GVDEKDVIPF GRNKAKISLD VLDKPATPGK 
LILVSAITPT PSGEGKTTVS IGLAQGLQAI GKKVCLALRQ PSMGPVFGRK GGATGGGKSS
LTPAEEINMH FTGDFHAITS AHNLISAIID NAMFFHTLNI DERKVIWKRV MDMNDRSLRS
IIVGLNKQGF PRETGFDITP ASEIMACLCL ATSYKDMEER INRIVIGFTT DDKPVFAREL
GITGSVMSLL KDALMPNLVQ SVEGVPCFLH GGPFANIAHG CNSVLATRMA LHFGDYAVTE
AGFAFDLGAE KFLDIKCRQS GLDPAAIVIV ATARALKMHG GTALADLKNT DVDALKKGLA
NLDAHLDAAA HYKRPVVVAV NKFFDDSREE LDAIVKHCAE RGIPCAIADI FSQGGEGGKD
LAQMVVEAAD RSSAPFKPLY ESALPVEEKL NIIARNIYGA DGVELTAAAK KKLAQFEASR
LTDLPICMAK TQNSLSDNGR LRGRPTGFTI TVRDFEIANG AGFLVALCGE IMRMPALPVS
PNAMHIYLDD KGNVQGL