Gene Amuc_0937 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0937 
Symbol 
ID6274228 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1114739 
End bp1116295 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content61% 
IMG OID642612991 
Productmetal dependent phosphohydrolase 
Protein accessionYP_001877550 
Protein GI187735438 
COG category[F] Nucleotide transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0248] Exopolyphosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCAAG GCGAGCAGGT TCAGGATTCG GAATTATCAA CTCAGGCGGT GATCTATCTG 
GGCCCGAGCT CCATGAGCCT GATGGTGGCT GAGGCTGTCC GGGACAGGAT TCGCCTGCTG
GATTTCCTTC AGCAGCCCGT TCCGATGGCG CGTGACATTT TCCGGTTCCA CCGCATTTCC
CGGCATACCA TGGACCGCTG CGTGCAAATC ATCGGCGACT ATCTGGAAAT TCTCAAGGAA
TACGGAGCCG GCAGCAAGCT TTCCGTCCGG TTCATGATTT CCAACATCAT TTCCGAGGCG
GATAATGTGG ACGTGTTCGT GAACCGCATG CACGTGGCCC ACGGCTTGCG GGGGCGCCGC
ATTGACGACG GCAAGATGAC GCGCCTCATT TACGTGAAGG TGCAGGAGGC TCTGGCCCAG
TATCCGGGAT TCAGCAAGAA AAAGGTGCTT GTGGTCCATA CGGGGCCGGG CAATACCCGC
GTGCTTCTGT TCCAGAAGGG GCGCATCGTG CGTTATTCCT GCTACAGGCT GGGAACGCAC
CGCACGGGGG AGGCCGTCGG GGAAATTGAG TACGGAGACG ATGTGGCGGA GCTTTCCATT
CTGCGGGAGC ACATGCGCGG GCAGGTGGAC CAGATTTGCC TGGATTACGG GGGCGTGAAG
GGCCTGGCGG GCCTTATCGT CATCGGCCAG GAAATGCAGC AGCTCCGGGA CCGCCTGGCC
CCCACGCCGG AAGGCAAGGT GGCGTGTTCC TCCCTGGCGG CGGAGGCGGA GCGGATGTCC
CGCACCACTC TGGAACAGCG CATGAATGTT TACGGTGCGG ATTTTGCCGG GGTGGACTCC
CTGCTGCCCG CCGTTTTGAT GACGGAAATG ATTGCCCGCA GCCTGAACCT GGATGACGTC
ATCATTCCCG CGAGCGGTTA TGACGAGGAG TTTTCAAGCA GCCTGATACG TGCGGAACAG
CATCCGGGGG ATCTGGAGGC GGAGGTTCTC CATTTCGCCG GGATTCTGGC GGACAGGTAC
AAGGCGGACA AAGGGCACCG CGAGCATGTG GCGCGCCTGT GCATGGAAAT GTTTGACCAG
CTTCAGGACC TGCACCGCCT TTCCGAACAT GACCGGCTGC TGCTGGAAGT GGCCTCCATT
CTGCATGAGG TTGGGTCTTT TATCAACCAG CAGAATCACC AGCTCCATTC CCAGTATATC
ATTCTCAACA GTGAAATCTT CGGCCTTTCC CGGGATGATG TGGAAACGAT CGCCCTGCTG
GCCCGCTACC ACCGGCATGA GGTTCCCGCC AATTCCGATC CCATGTACGG GGAGCTGGAA
TTGAGGGACC GCATGCGCGT AGCCAAGATG GCCGCCATCC TGCGCGTGGC GGATGCCCTG
GAACGCGGCC ATGCCCAGCG CGTGAACGGC GTCCGGGCGC ACATCCGCGG GCGCATGCTG
GAGCTGGAGC TTCAGGGCGT GCGTGAAACC GCCGTGGAAG ACCTGGCCCT GCGGCTGAAG
GGCGACCTGT TTGCGGATAT CTTCGGTTAT GACGTCGTGC TGGCGCCCCA GCGGTAG
 
Protein sequence
MIQGEQVQDS ELSTQAVIYL GPSSMSLMVA EAVRDRIRLL DFLQQPVPMA RDIFRFHRIS 
RHTMDRCVQI IGDYLEILKE YGAGSKLSVR FMISNIISEA DNVDVFVNRM HVAHGLRGRR
IDDGKMTRLI YVKVQEALAQ YPGFSKKKVL VVHTGPGNTR VLLFQKGRIV RYSCYRLGTH
RTGEAVGEIE YGDDVAELSI LREHMRGQVD QICLDYGGVK GLAGLIVIGQ EMQQLRDRLA
PTPEGKVACS SLAAEAERMS RTTLEQRMNV YGADFAGVDS LLPAVLMTEM IARSLNLDDV
IIPASGYDEE FSSSLIRAEQ HPGDLEAEVL HFAGILADRY KADKGHREHV ARLCMEMFDQ
LQDLHRLSEH DRLLLEVASI LHEVGSFINQ QNHQLHSQYI ILNSEIFGLS RDDVETIALL
ARYHRHEVPA NSDPMYGELE LRDRMRVAKM AAILRVADAL ERGHAQRVNG VRAHIRGRML
ELELQGVRET AVEDLALRLK GDLFADIFGY DVVLAPQR