Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1840 |
Symbol | |
ID | 6274794 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 2236215 |
End bp | 2238038 |
Gene Length | 1824 bp |
Protein Length | 607 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 642613903 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_001878438 |
Protein GI | 187736326 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.170642 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 67 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGATA CCTCCAATTT GTCTTATCCC GGTTCCCGCC GCATTTATGT TCCGGGCCGT CTGTACCCGG ATGTTCGGGT TCCCATGAGG GAAATCATTT TGGGCGATAC CTTGTTGCCC GACGGCACCG CCCACCCTAA TGACCCGGTG CGCGTGTACG ATTGCTCCGG CCCCTGGGGG GACGCTGCTT ATGAGGGAAC TGCGGAGGAG GGGCTTCCCT CCCTGCGTGC CGCCTGGATA CGGGCCCGCG GGGATGTGAA AGAGGATGTG GGGCACGAGC GCACCCTGCG TGCTGCGGGC AAGACGCCTG TGACACAGCG TTATTACGCC CAACAGGGCG TCATTACGCC GGAAATGGAA TTTGTCGCCA TCCGGGAAAA TCTGGGCAGG GAACAGGCTT TCAAGGCGAT ATACGACCGC TATCCAAATG CCAAGAGCCG CCCGGACGAA GCTGCGGAAG CCCTGGAAAC GCTCACCATG ATGCCGCGTC CGTCCGAATT GGAGGCGCAG GAGGGATTTG GGCCGTCCAG CATGGTGGCC CGCGACCGCC TGGATCACCA GCATGCCCCG GAACGCCGGA ACGGCTGCCG CATGCCTGCC TACTTTACGC CGGAATTCGT CAGGGATGAA ATTGCCTCCG GCCGTGCGCT GATTCCCGCC AACATCAATC ATCCGGAATG TGAGCCGATG GCTATCGGCC GCAATTTCCT GGTAAAAATC AACGCCAACA TAGGCAACTC CGCGCTGGGA TCCAGCATTG AGGAGGAGGT GGAAAAGCTG CGCTGGGCCA TTCACTGGGG GGCGGATACC GTTATGGACC TGTCCACCGG GAAGAATATC CACGCGACGA GGGAATGGAT TTTAAGAAAC TCCCCTGTCC CCATCGGTAC TGTTCCCATT TACCAGGCTC TGGAAAAGGT GGGAGGAAAG GTGGCTGACC TGAGTTGGGA GGTTTTTCGC GATACCCTGC TTGAACAGGC CCGCCAGGGC GTGGACTATG TGACGGTGCA CGCCGCCCTT CTGCTCAGGT TCGTGAACCA TACGGCGCGG CGCATGACGG GCATCGTTTC CCGCGGCGGT TCCATCATGG CGCAGTGGAG CATGATCCAC GAACAGGAAA ATTTCCTTTA TTCCCATTGG GATGAAATTT GTTCCATTCT GGCGGCTTAC GATATTGCCG TCTCCATCGG GGATGGCCTG CGGCCCGGTT CCGTGGCGGA CGCCAACGAC TTCGCCCAGC TGGCGGAGCT GGAAGTGCAG GGGGATTTGA CCATGCGCGC GTGGAAAGCG GGCGTGCAGG TCATGAATGA AGGCCCCGGC CACGTTCCCA TGCACCTTAT TAGGGAGAAC ATGAGCAAGC AGCTGGAATG GTGCATGGAA GCGCCCTTTT ACACGCTGGG GCCGCTGGTA ACGGACATCG CCCCCGGTTA CGACCATATT ACCGGAGCTA TCGGAGGGGC GATTATCGGC CAGCTTGGCT GCGCCATGCT TTGCTATGTA ACCAGAAAAG AGCATCTGGG GCTTCCTGAC CGGGAGGATG TGAGGGAAGG CGTGGTTGCC TACAAGCTGG CGGCCCACGC CGCAGACCTG GCGAAGGGGC ATCCATCCGC GCAATGGAGG GATAATGCTC TGGCCCAGGC CAGGTTTGAA TTCCGCTGGG AGGATCAATT CAACCTTTCC CTGGATCCGC AAAAGGCCCG TTCCTTCCAT GACCTGACGC TTCCCCATGC CAACGCCAAA AAAGCCCATT TCTGTTCCAT GTGCGGTCCG GACTTCTGCG CCATGCGCCT GAGCCAGGAT ATCCGCCGCC GCTCACAGCA ATAG
|
Protein sequence | MNDTSNLSYP GSRRIYVPGR LYPDVRVPMR EIILGDTLLP DGTAHPNDPV RVYDCSGPWG DAAYEGTAEE GLPSLRAAWI RARGDVKEDV GHERTLRAAG KTPVTQRYYA QQGVITPEME FVAIRENLGR EQAFKAIYDR YPNAKSRPDE AAEALETLTM MPRPSELEAQ EGFGPSSMVA RDRLDHQHAP ERRNGCRMPA YFTPEFVRDE IASGRALIPA NINHPECEPM AIGRNFLVKI NANIGNSALG SSIEEEVEKL RWAIHWGADT VMDLSTGKNI HATREWILRN SPVPIGTVPI YQALEKVGGK VADLSWEVFR DTLLEQARQG VDYVTVHAAL LLRFVNHTAR RMTGIVSRGG SIMAQWSMIH EQENFLYSHW DEICSILAAY DIAVSIGDGL RPGSVADAND FAQLAELEVQ GDLTMRAWKA GVQVMNEGPG HVPMHLIREN MSKQLEWCME APFYTLGPLV TDIAPGYDHI TGAIGGAIIG QLGCAMLCYV TRKEHLGLPD REDVREGVVA YKLAAHAADL AKGHPSAQWR DNALAQARFE FRWEDQFNLS LDPQKARSFH DLTLPHANAK KAHFCSMCGP DFCAMRLSQD IRRRSQQ
|
| |