Gene Amuc_2138 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_2138 
Symbol 
ID6273705 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2605768 
End bp2606859 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content59% 
IMG OID642614200 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_001878728 
Protein GI187736616 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.302169 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.180028 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTATTG AGAGTTTCGC AAATCAGCAT GTGCTGGAGT TGGTGGCTTA TCAGCCGGGC 
AAGCCGATTG AGGAGACGGC CCGGGAACTG GGCCTGAATC CCCATGATAT TGTGAAACTG
GCGTCTAATG AAAATCCTCT GGGGCCGTCC CCGAAGGCGG TGGAGGCGAT TGCCCGTGCG
GCTGCCGGCG TGAACATTTA TCCGGACGGA GCGGCTTTCC GCCTGCGTTC CGCTATTGCG
GAGTTCTGCG GCGTGGAGTT CGGCCAGACG GTAGTGGGCA CGGGCAGCAG CGAAGTGATT
GAACTGATCT GCCATGCCCT GCTGAATCCC CGTGCGGAGG TGGTGGCCGC CAAACACGCC
TTTTCCATGT ACCCTATCAT GTCCAAGCTG TTTGGCGCCG CGTACGTGGA AGTGCCCAAC
AAGGAGGACT GGACGCATGA CCTGGACGGC TTCCTGGCCG CTATTACGGA GAATACGCGC
GTCGTGTTCA TTACGAATCC CACCAATCCC GTAGGCACCG TAGTGGGACA GCAGGAAATA
GACGATTTCA TGGCGAAGGT CCCGGAACAT GTGCTGGTGG TCTTTGACGA GGCGTACCGG
GAGTTTTCCG ACAATCCTCC GGATACCCTC AAATTTGTGC GAGAAGGCCG CAACGTGGTT
GTTCTGCGCA CCTTCTCCAA GGCTTACGGC CTGGCGGGGC TGCGCGTGGG CTACGGCATT
GCGCCGGAAC CGGTTTGCAG CATGCTGCAC AAGGCGCGTG CTCCGTTCAA CCTGCATGTT
CTGGCCCAGG AGGCCGCCCT GGCCGCCCTG GAGGACCGGG AGCATGTGCG CCGCACTGTG
GAGAATAATA AGGAAGGCAT GCGTTTTTAT GAGCAGGCTT TCCGGGAAAT GGGCCTGGAA
TGGATTCCCA GCCAGGGCAA CTTTATCCTG GTGAAAGTGG GCCGGGGCAA GCAGGTGTTC
CAGGATATGC TTGCCAGGGG AGTCATCGTC CGCGCGCAGG ACGGTTACGG CCTGCCGGAA
TGGATACGCA TCAGCATAGG CACTCCTGCG GAGAACGCCC GCTGCATTGA GGTATTGAAA
GAGGTTCTTT AA
 
Protein sequence
MSIESFANQH VLELVAYQPG KPIEETAREL GLNPHDIVKL ASNENPLGPS PKAVEAIARA 
AAGVNIYPDG AAFRLRSAIA EFCGVEFGQT VVGTGSSEVI ELICHALLNP RAEVVAAKHA
FSMYPIMSKL FGAAYVEVPN KEDWTHDLDG FLAAITENTR VVFITNPTNP VGTVVGQQEI
DDFMAKVPEH VLVVFDEAYR EFSDNPPDTL KFVREGRNVV VLRTFSKAYG LAGLRVGYGI
APEPVCSMLH KARAPFNLHV LAQEAALAAL EDREHVRRTV ENNKEGMRFY EQAFREMGLE
WIPSQGNFIL VKVGRGKQVF QDMLARGVIV RAQDGYGLPE WIRISIGTPA ENARCIEVLK
EVL