Gene Amuc_2039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_2039 
Symbol 
ID6273708 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2475865 
End bp2477499 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content60% 
IMG OID642614100 
Productputative alpha-isopropylmalate/homocitrate synthase family transferase 
Protein accessionYP_001878630 
Protein GI187736518 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR00977] 2-isopropylmalate synthase/homocitrate synthase family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000986258 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.000484057 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCACCA AAATCTTTCT CTATGATACT ACGTTGCGGG ACGGCGCGCA GAGTGAAGAC 
GTCAATTTGA GCGCGACGGA CAAGGTCCGC ATCGCCCGCC AGCTGGATTA TCTGGGCATG
GATTACATTG AGGGCGGCTG GCCCGGCGCC AATCCGGTGG AAACGGAATT TTTCAACGCC
ATGAGAGGTG TCGGACTAAG GAATGCAAAG CTGGCCGCCT TTGGAAGCAC GCACCATCCT
TCCCATACTC CGGAGACGGA CCCCACGCTG ACCGCCCTGA TATCCAGCGG TGCGCGGGTG
GCCGCGGTTT TTGGGAAATC CTGCCCCCGC CATGTGGAAG TGGCCCTGGG CATTTCCCGG
GAACGCAATC TGGAAATCAT CGGCAATTCC ATTTCCTTCC TTAAGAAAAA TATGGAAGAA
GCCTTTTTTG ACGCAGAACA CTTTTTTGAC GGCTTCAAGC GGGATCAGGA ATACGCCCTG
GCTGTGCTCC GGACAGCCTG GGAACATGGA GCGGACTGCC TGGTGCTGTG CGATACCAAC
GGAGGCACCA TGCCGGAGGA AATCAGCTCC ATTATCAGGA CGGTCAGGGA ACGCCTGCCG
CATGCCCTTC TGGGCATCCA CGCCCACAAT GACTGTGAAC TGGCCGTCGC CAACAGCCTG
GCCGCGGTAA ACAGCGGAGC CATCCAGGTG CAGGGGACCG TGAACGGCAT CGGGGAGCGT
TGCGGAAATG CCAATCTTTG TTCCGTCATC CCCAACCTCC AGGTGAAGAT GAAAGGCTTT
TCCTGCCTCA GCGGCGCCTC CCTGACGCGG CTCAAATCCA CTGCCGCCTT TGTCTCGGAA
GTATCCAATC TGGCGCCTTT CCGGCGGCAG CCCTTCGTGG GGAACGCCGC GTTCGCCCAT
AAGGGCGGAG TGCATGTCAG CGCGATCATG AAGGAAGCCG CTTTGTACGA GCACATCGAC
CCCTCCCTGG TGGGGAACGC CCAGCGCGTG CTGATGACGG AGCAGGGCGG CAGGAGCAAC
ATCCTTTCCC TGTCCCGCAC CCTGGGTTTT GAACTGGAAA AGGGAGACCC CCTTCTGGAC
GTGCTTTCCG CCGCCGTGAA GAAAAATGCC GCGCTGGGGT ATGATTACGT GGCCGCCCCG
GCCAGCGCGG AGCTGCTCTT CCTGCGGCAC ATGCCGGACA ATGCCTTGAA ACCGTATTTC
AACATCCTGC GCACTGTGGT GCTGACCTCA CGCCATGAAA TGGACCCGGA CATGATGGTG
GAAGCCTCCC TCAAGCTGGA TGTCCACGGC AATGTGGAGC ACACCGCCGC CGGGGGCTTT
GGCCCCGTGC ATGCGCTGGA CAGGGCTCTG CGCCGCGCCC TGACGCGCTG GTATCCGGAA
TTGGAGCAGA TGCACCTCAT CGACTACAAG GTGCGCGTGC TTTCCCCCAC CCGGACGAAC
ATTCCGGAGG CGGAGGATGA AAACGGAACC GGCTCCAATG TCCGCGTGCT TATTGAGTCC
TCGGACGGCG TCGCCACCTG GACCACTGTG GGCGTTTCCT ACAACATTAT TGAGGCCAGC
CTGGAAGCCC TGGCGGACGC CGTCACGTAC AAGCTCTACA AGACGGAACA GGCCAGATGG
CGTGCGGAAT GCTGA
 
Protein sequence
MSTKIFLYDT TLRDGAQSED VNLSATDKVR IARQLDYLGM DYIEGGWPGA NPVETEFFNA 
MRGVGLRNAK LAAFGSTHHP SHTPETDPTL TALISSGARV AAVFGKSCPR HVEVALGISR
ERNLEIIGNS ISFLKKNMEE AFFDAEHFFD GFKRDQEYAL AVLRTAWEHG ADCLVLCDTN
GGTMPEEISS IIRTVRERLP HALLGIHAHN DCELAVANSL AAVNSGAIQV QGTVNGIGER
CGNANLCSVI PNLQVKMKGF SCLSGASLTR LKSTAAFVSE VSNLAPFRRQ PFVGNAAFAH
KGGVHVSAIM KEAALYEHID PSLVGNAQRV LMTEQGGRSN ILSLSRTLGF ELEKGDPLLD
VLSAAVKKNA ALGYDYVAAP ASAELLFLRH MPDNALKPYF NILRTVVLTS RHEMDPDMMV
EASLKLDVHG NVEHTAAGGF GPVHALDRAL RRALTRWYPE LEQMHLIDYK VRVLSPTRTN
IPEAEDENGT GSNVRVLIES SDGVATWTTV GVSYNIIEAS LEALADAVTY KLYKTEQARW
RAEC