Gene Amuc_0785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0785 
Symbol 
ID6274397 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp921168 
End bp922277 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content59% 
IMG OID642612836 
Product3-isopropylmalate dehydrogenase 
Protein accessionYP_001877400 
Protein GI187735288 
COG category[C] Energy production and conversion
[E] Amino acid transport and metabolism 
COG ID[COG0473] Isocitrate/isopropylmalate dehydrogenase 
TIGRFAM ID[TIGR00169] 3-isopropylmalate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.537709 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGAAC ATCACCATCA TATTGCTGTC TTGGCCGGCG ACGGTATCGG TCCCGAGGTC 
ATGGCACAGG CTCTGAAGGT TCTGGATGCC GTAAGCGGCA AGTTTGGCTT TACCGTAAGC
CGCAAGGAGG CATTTGTAGG GGGTGCGGGC ATTGACCACT GCGGCAAGGC TCTTCCCGAA
GAAACCATTC GCGCCTGCGA AGAGGCGGAC GCCGTTCTGT TCGGCTCCGT GGGCGGTCCC
AAATGGGAGC ATCTCCCCGC CAATGAACAA CCGGAACGCG GAGCCCTGCT GCCTCTGAGA
AAACATTTCG GCTTGTATGC CAACCTGCGT CCGGGCGTAT GCCTTCCGGC CTTGACCCAT
GCCTCCCCCA TCAAGAATGA ACTGATTGAA GGCGGATTCG ATATTTTATG TGTCCGGGAA
TTGACTGGCG GCCTGTATTT CGGCCAGCCC CGTTTCCGCG AACGGGAGGG AGACGACGAA
GTCGTCGTGG ATACCATGCG CTACCATAAA AGCGAGATGG TGCGTATCGC CAAAGTGGCG
TTCGAGGCTG CGCGCGGCCG CCGTAAGCGC GTCACCAGCG TGGACAAGGC CAATGTGCTG
ACCAATTCCC TGCTGTGGCG CGAGACAATG ATTGAAGTTT CCAAGGATTA TCCCGACGTG
GAATTGCTGC ATATGTACGT GGACAATGCA GCCATGCAGC TGGTGCGCAA TCCCCGCCAG
TTCGACGTGC TGGTGACGGA AAACCTGTTC GGGGACATTC TTTCCGATGA AATGGCCATG
ATTTGCGGCT CCCTGGGCAT GCTGCCCAGC GCCAGCCTGT GCCAAGGCGC GCAGGACAAC
GGCCTGTTCT TCGGCCTTTA CGAACCCTCC GGAGGCTCCG CCCCGGATAT TGCCGGCAAG
GGCATTGCAA ACCCAATCGC CCAGATTCTT TCCCTTTCCA TGCTGTTGCG CTACTCCCTG
GGAGAAAAAA CAGCGGCAGA CGCCATTGAC TCCGCCGTTC GCCGCGTGAT TGACCAGGGC
TGCCGCACGG GCGACCTGGC TACGGGAGCT CCCGGAGAAA TCCGCGTGAA TACGGCGGAA
ATGGGTGACG CCATCATCGC CGCCCTGTAA
 
Protein sequence
MSEHHHHIAV LAGDGIGPEV MAQALKVLDA VSGKFGFTVS RKEAFVGGAG IDHCGKALPE 
ETIRACEEAD AVLFGSVGGP KWEHLPANEQ PERGALLPLR KHFGLYANLR PGVCLPALTH
ASPIKNELIE GGFDILCVRE LTGGLYFGQP RFREREGDDE VVVDTMRYHK SEMVRIAKVA
FEAARGRRKR VTSVDKANVL TNSLLWRETM IEVSKDYPDV ELLHMYVDNA AMQLVRNPRQ
FDVLVTENLF GDILSDEMAM ICGSLGMLPS ASLCQGAQDN GLFFGLYEPS GGSAPDIAGK
GIANPIAQIL SLSMLLRYSL GEKTAADAID SAVRRVIDQG CRTGDLATGA PGEIRVNTAE
MGDAIIAAL