Gene Amuc_2120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_2120 
Symbol 
ID6273746 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2582566 
End bp2583468 
Gene Length903 bp 
Protein Length300 aa 
Translation table11 
GC content60% 
IMG OID642614182 
Producthypothetical protein 
Protein accessionYP_001878710 
Protein GI187736598 
COG category[R] General function prediction only 
COG ID[COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.526996 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.0698278 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATACC TGAATCACAA CCTGCTCAGC CGCCTGGGGA ACCTTCCCCT GGAAGCGCGC 
CACAGCATGA CGGGGAACGT ATCCGGCCGC CATAGAAGCG CCAGCCGCGG CTCCTCCGTG
GAATTTGCGG AATACCGCAA ATACGTGGCC GGAGACGATA CACGCCGTCT GGACTGGAAA
GCGTATGCAC GGTCGGACAG GTATTACATC AAGGAGTTTG AAGCGGATAC CAACCTGCGC
GCCTACATCG TCATGGATCT TTCCGGCTCC ATGAATTATC ATCCGGAGCA GGTGGAGACC
AAGTATATGC GCGCGTGCAG GCTGGCGGCC AACCTGGCTT ACATTGCCAT CCGGCAGGGA
GACGCGGTGG GACTGAGTTT TGCCCGGCAG ACGAAGGACG GCGCCGCGCT GCACATCCCC
GCCTCCCGCC GCCCCGCCCA CCTGAACGTG CTCATCAGCC AGATGGACAC GCATTCTCCG
CAGGGGGAGA CCGTCCTTCC CGATACCCTG CATGAACTGG CGGAACGCGT CGGCCGCCGC
GCCCTGGTGT TGATTTTCTC AGACCTGTTC ACAGATACGG CGGAGCTTAA AAACGCCCTC
CGCCACCTCC ATTTCCGCAA GCATGACATA GCCGTATTCC ATCTGGTGGA CCAGTTGGAA
ATAGATTTTG ACTTTGACCG CCCCATCCGC TTCGTGGACA TGGAAGGCGG CGGCTCCCTG
ATTACGGAGC CGGACCTTAT CGCGGACGAG TACCGGGCTA TCGTGGCCAG CTATCTGGAA
GAAACGCGCC GGATCTGCAC GGATATCAAT GCGGACTACC GCCTGGTCAG AACCGGAGAT
TCCCTGGAAG ACGTGCTGAC CGGCTTCCTG ATGGGACGTC AGAAAAAGAA GGCGGCCGGA
TAA
 
Protein sequence
MKYLNHNLLS RLGNLPLEAR HSMTGNVSGR HRSASRGSSV EFAEYRKYVA GDDTRRLDWK 
AYARSDRYYI KEFEADTNLR AYIVMDLSGS MNYHPEQVET KYMRACRLAA NLAYIAIRQG
DAVGLSFARQ TKDGAALHIP ASRRPAHLNV LISQMDTHSP QGETVLPDTL HELAERVGRR
ALVLIFSDLF TDTAELKNAL RHLHFRKHDI AVFHLVDQLE IDFDFDRPIR FVDMEGGGSL
ITEPDLIADE YRAIVASYLE ETRRICTDIN ADYRLVRTGD SLEDVLTGFL MGRQKKKAAG