Gene Amuc_1417 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1417 
Symbol 
ID6275767 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1695420 
End bp1696445 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content58% 
IMG OID642613475 
Productglyceraldehyde-3-phosphate dehydrogenase, type I 
Protein accessionYP_001878021 
Protein GI187735909 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0057] Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase 
TIGRFAM ID[TIGR01534] glyceraldehyde-3-phosphate dehydrogenase, type I 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.64657 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value0.617854 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAAT ACGCTATTAA CGGTTTTGGA CGCATTGGTC GCAACGTACT GCGCGCCATG 
TCCAAGGAAG AACGCAACAA GGTTGTTGCC ATCAATGACC TGACTCCTAT CGAAACGATC
GCCCACCTGC TCAAGTATGA CTCCACGCAG GGCAAGTTTG ACGGTGAAAT TTCCATCGAG
GGTGATTATC TGGTCGTTGA CGGTCACAAG ATCCTCATCA CCGTGGAACG TGATCCCGCC
AACCTTCCCT GGAAGGATCT GGGCGTGGAC GTCGTTCTGG AATCCACCGG CCTGTTCACC
AAGCGCGACG CCGCCAAGAA GCACCTTGAC GCCGGCGCCA AGAAGGTTCT TATTTCCGCT
CCCTCCCCGG ATCCGGACCT GACTTTCGTT CTGGGCATCA ACGACAGCGA ATACGATCCT
GCCAAGCACG ATATCGTTTC CAACGCTTCC TGCACCACCA ACTGCCTTGC TCCGATGGTG
AAGGTGCTGG ACGACAAGTT CGGCGTTGAA AAGGGCATGA TGAGCACGAT TCACTCCTAC
ACGAACGACC AGCGCATTCT GGACCTTCCG CACAAGGATC CCCGCCGTGC CCGCGCCGCC
GCGATCAACA TCATTCCGAC GACCACCGGC GCCGCCAAGG CCATTGGTGA AGTAATGCCG
AACCTGAAGG GTTCCCTGAA CGGCGCTTCC TTCCGCGTTC CGACTCCGAC CGGTTCCCTG
ACCGACTTTG TGGCCGTGCT CAAGAAGGAT GTGACCGTGG AAGAAGTAAA CGCCGCCATG
AAGGAAGCCG CTGAAGGCCC GCTGAAGGGC ATTCTGGCTT ACTCCGAAGA AGCGCTCGTT
CTTCAGGACA TCGTTTCCGA CCCCCACTCC TGCATCTTTG ACTCCGGCTT CACGTATGTG
GTCGGCGGCA ACCTGGTGAA GGTCTGCGGC TGGTACGACA ACGAATGGGG TTACTCCAAC
CGCGCCGCCC AGGCCATGAA GAAGCTGGGC GACAGCCTGG GCTGCGGATG CTCCTGCGGC
AAGTAA
 
Protein sequence
MAKYAINGFG RIGRNVLRAM SKEERNKVVA INDLTPIETI AHLLKYDSTQ GKFDGEISIE 
GDYLVVDGHK ILITVERDPA NLPWKDLGVD VVLESTGLFT KRDAAKKHLD AGAKKVLISA
PSPDPDLTFV LGINDSEYDP AKHDIVSNAS CTTNCLAPMV KVLDDKFGVE KGMMSTIHSY
TNDQRILDLP HKDPRRARAA AINIIPTTTG AAKAIGEVMP NLKGSLNGAS FRVPTPTGSL
TDFVAVLKKD VTVEEVNAAM KEAAEGPLKG ILAYSEEALV LQDIVSDPHS CIFDSGFTYV
VGGNLVKVCG WYDNEWGYSN RAAQAMKKLG DSLGCGCSCG K