Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_2039 |
Symbol | |
ID | 6273708 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 2475865 |
End bp | 2477499 |
Gene Length | 1635 bp |
Protein Length | 544 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 642614100 |
Product | putative alpha-isopropylmalate/homocitrate synthase family transferase |
Protein accession | YP_001878630 |
Protein GI | 187736518 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0119] Isopropylmalate/homocitrate/citramalate synthases |
TIGRFAM ID | [TIGR00977] 2-isopropylmalate synthase/homocitrate synthase family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00000986258 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.000484057 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCCACCA AAATCTTTCT CTATGATACT ACGTTGCGGG ACGGCGCGCA GAGTGAAGAC GTCAATTTGA GCGCGACGGA CAAGGTCCGC ATCGCCCGCC AGCTGGATTA TCTGGGCATG GATTACATTG AGGGCGGCTG GCCCGGCGCC AATCCGGTGG AAACGGAATT TTTCAACGCC ATGAGAGGTG TCGGACTAAG GAATGCAAAG CTGGCCGCCT TTGGAAGCAC GCACCATCCT TCCCATACTC CGGAGACGGA CCCCACGCTG ACCGCCCTGA TATCCAGCGG TGCGCGGGTG GCCGCGGTTT TTGGGAAATC CTGCCCCCGC CATGTGGAAG TGGCCCTGGG CATTTCCCGG GAACGCAATC TGGAAATCAT CGGCAATTCC ATTTCCTTCC TTAAGAAAAA TATGGAAGAA GCCTTTTTTG ACGCAGAACA CTTTTTTGAC GGCTTCAAGC GGGATCAGGA ATACGCCCTG GCTGTGCTCC GGACAGCCTG GGAACATGGA GCGGACTGCC TGGTGCTGTG CGATACCAAC GGAGGCACCA TGCCGGAGGA AATCAGCTCC ATTATCAGGA CGGTCAGGGA ACGCCTGCCG CATGCCCTTC TGGGCATCCA CGCCCACAAT GACTGTGAAC TGGCCGTCGC CAACAGCCTG GCCGCGGTAA ACAGCGGAGC CATCCAGGTG CAGGGGACCG TGAACGGCAT CGGGGAGCGT TGCGGAAATG CCAATCTTTG TTCCGTCATC CCCAACCTCC AGGTGAAGAT GAAAGGCTTT TCCTGCCTCA GCGGCGCCTC CCTGACGCGG CTCAAATCCA CTGCCGCCTT TGTCTCGGAA GTATCCAATC TGGCGCCTTT CCGGCGGCAG CCCTTCGTGG GGAACGCCGC GTTCGCCCAT AAGGGCGGAG TGCATGTCAG CGCGATCATG AAGGAAGCCG CTTTGTACGA GCACATCGAC CCCTCCCTGG TGGGGAACGC CCAGCGCGTG CTGATGACGG AGCAGGGCGG CAGGAGCAAC ATCCTTTCCC TGTCCCGCAC CCTGGGTTTT GAACTGGAAA AGGGAGACCC CCTTCTGGAC GTGCTTTCCG CCGCCGTGAA GAAAAATGCC GCGCTGGGGT ATGATTACGT GGCCGCCCCG GCCAGCGCGG AGCTGCTCTT CCTGCGGCAC ATGCCGGACA ATGCCTTGAA ACCGTATTTC AACATCCTGC GCACTGTGGT GCTGACCTCA CGCCATGAAA TGGACCCGGA CATGATGGTG GAAGCCTCCC TCAAGCTGGA TGTCCACGGC AATGTGGAGC ACACCGCCGC CGGGGGCTTT GGCCCCGTGC ATGCGCTGGA CAGGGCTCTG CGCCGCGCCC TGACGCGCTG GTATCCGGAA TTGGAGCAGA TGCACCTCAT CGACTACAAG GTGCGCGTGC TTTCCCCCAC CCGGACGAAC ATTCCGGAGG CGGAGGATGA AAACGGAACC GGCTCCAATG TCCGCGTGCT TATTGAGTCC TCGGACGGCG TCGCCACCTG GACCACTGTG GGCGTTTCCT ACAACATTAT TGAGGCCAGC CTGGAAGCCC TGGCGGACGC CGTCACGTAC AAGCTCTACA AGACGGAACA GGCCAGATGG CGTGCGGAAT GCTGA
|
Protein sequence | MSTKIFLYDT TLRDGAQSED VNLSATDKVR IARQLDYLGM DYIEGGWPGA NPVETEFFNA MRGVGLRNAK LAAFGSTHHP SHTPETDPTL TALISSGARV AAVFGKSCPR HVEVALGISR ERNLEIIGNS ISFLKKNMEE AFFDAEHFFD GFKRDQEYAL AVLRTAWEHG ADCLVLCDTN GGTMPEEISS IIRTVRERLP HALLGIHAHN DCELAVANSL AAVNSGAIQV QGTVNGIGER CGNANLCSVI PNLQVKMKGF SCLSGASLTR LKSTAAFVSE VSNLAPFRRQ PFVGNAAFAH KGGVHVSAIM KEAALYEHID PSLVGNAQRV LMTEQGGRSN ILSLSRTLGF ELEKGDPLLD VLSAAVKKNA ALGYDYVAAP ASAELLFLRH MPDNALKPYF NILRTVVLTS RHEMDPDMMV EASLKLDVHG NVEHTAAGGF GPVHALDRAL RRALTRWYPE LEQMHLIDYK VRVLSPTRTN IPEAEDENGT GSNVRVLIES SDGVATWTTV GVSYNIIEAS LEALADAVTY KLYKTEQARW RAEC
|
| |