Gene Amuc_1840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1840 
Symbol 
ID6274794 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2236215 
End bp2238038 
Gene Length1824 bp 
Protein Length607 aa 
Translation table11 
GC content60% 
IMG OID642613903 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_001878438 
Protein GI187736326 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.170642 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGATA CCTCCAATTT GTCTTATCCC GGTTCCCGCC GCATTTATGT TCCGGGCCGT 
CTGTACCCGG ATGTTCGGGT TCCCATGAGG GAAATCATTT TGGGCGATAC CTTGTTGCCC
GACGGCACCG CCCACCCTAA TGACCCGGTG CGCGTGTACG ATTGCTCCGG CCCCTGGGGG
GACGCTGCTT ATGAGGGAAC TGCGGAGGAG GGGCTTCCCT CCCTGCGTGC CGCCTGGATA
CGGGCCCGCG GGGATGTGAA AGAGGATGTG GGGCACGAGC GCACCCTGCG TGCTGCGGGC
AAGACGCCTG TGACACAGCG TTATTACGCC CAACAGGGCG TCATTACGCC GGAAATGGAA
TTTGTCGCCA TCCGGGAAAA TCTGGGCAGG GAACAGGCTT TCAAGGCGAT ATACGACCGC
TATCCAAATG CCAAGAGCCG CCCGGACGAA GCTGCGGAAG CCCTGGAAAC GCTCACCATG
ATGCCGCGTC CGTCCGAATT GGAGGCGCAG GAGGGATTTG GGCCGTCCAG CATGGTGGCC
CGCGACCGCC TGGATCACCA GCATGCCCCG GAACGCCGGA ACGGCTGCCG CATGCCTGCC
TACTTTACGC CGGAATTCGT CAGGGATGAA ATTGCCTCCG GCCGTGCGCT GATTCCCGCC
AACATCAATC ATCCGGAATG TGAGCCGATG GCTATCGGCC GCAATTTCCT GGTAAAAATC
AACGCCAACA TAGGCAACTC CGCGCTGGGA TCCAGCATTG AGGAGGAGGT GGAAAAGCTG
CGCTGGGCCA TTCACTGGGG GGCGGATACC GTTATGGACC TGTCCACCGG GAAGAATATC
CACGCGACGA GGGAATGGAT TTTAAGAAAC TCCCCTGTCC CCATCGGTAC TGTTCCCATT
TACCAGGCTC TGGAAAAGGT GGGAGGAAAG GTGGCTGACC TGAGTTGGGA GGTTTTTCGC
GATACCCTGC TTGAACAGGC CCGCCAGGGC GTGGACTATG TGACGGTGCA CGCCGCCCTT
CTGCTCAGGT TCGTGAACCA TACGGCGCGG CGCATGACGG GCATCGTTTC CCGCGGCGGT
TCCATCATGG CGCAGTGGAG CATGATCCAC GAACAGGAAA ATTTCCTTTA TTCCCATTGG
GATGAAATTT GTTCCATTCT GGCGGCTTAC GATATTGCCG TCTCCATCGG GGATGGCCTG
CGGCCCGGTT CCGTGGCGGA CGCCAACGAC TTCGCCCAGC TGGCGGAGCT GGAAGTGCAG
GGGGATTTGA CCATGCGCGC GTGGAAAGCG GGCGTGCAGG TCATGAATGA AGGCCCCGGC
CACGTTCCCA TGCACCTTAT TAGGGAGAAC ATGAGCAAGC AGCTGGAATG GTGCATGGAA
GCGCCCTTTT ACACGCTGGG GCCGCTGGTA ACGGACATCG CCCCCGGTTA CGACCATATT
ACCGGAGCTA TCGGAGGGGC GATTATCGGC CAGCTTGGCT GCGCCATGCT TTGCTATGTA
ACCAGAAAAG AGCATCTGGG GCTTCCTGAC CGGGAGGATG TGAGGGAAGG CGTGGTTGCC
TACAAGCTGG CGGCCCACGC CGCAGACCTG GCGAAGGGGC ATCCATCCGC GCAATGGAGG
GATAATGCTC TGGCCCAGGC CAGGTTTGAA TTCCGCTGGG AGGATCAATT CAACCTTTCC
CTGGATCCGC AAAAGGCCCG TTCCTTCCAT GACCTGACGC TTCCCCATGC CAACGCCAAA
AAAGCCCATT TCTGTTCCAT GTGCGGTCCG GACTTCTGCG CCATGCGCCT GAGCCAGGAT
ATCCGCCGCC GCTCACAGCA ATAG
 
Protein sequence
MNDTSNLSYP GSRRIYVPGR LYPDVRVPMR EIILGDTLLP DGTAHPNDPV RVYDCSGPWG 
DAAYEGTAEE GLPSLRAAWI RARGDVKEDV GHERTLRAAG KTPVTQRYYA QQGVITPEME
FVAIRENLGR EQAFKAIYDR YPNAKSRPDE AAEALETLTM MPRPSELEAQ EGFGPSSMVA
RDRLDHQHAP ERRNGCRMPA YFTPEFVRDE IASGRALIPA NINHPECEPM AIGRNFLVKI
NANIGNSALG SSIEEEVEKL RWAIHWGADT VMDLSTGKNI HATREWILRN SPVPIGTVPI
YQALEKVGGK VADLSWEVFR DTLLEQARQG VDYVTVHAAL LLRFVNHTAR RMTGIVSRGG
SIMAQWSMIH EQENFLYSHW DEICSILAAY DIAVSIGDGL RPGSVADAND FAQLAELEVQ
GDLTMRAWKA GVQVMNEGPG HVPMHLIREN MSKQLEWCME APFYTLGPLV TDIAPGYDHI
TGAIGGAIIG QLGCAMLCYV TRKEHLGLPD REDVREGVVA YKLAAHAADL AKGHPSAQWR
DNALAQARFE FRWEDQFNLS LDPQKARSFH DLTLPHANAK KAHFCSMCGP DFCAMRLSQD
IRRRSQQ