Gene Amuc_0119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0119 
Symbol 
ID6274915 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp146653 
End bp147972 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content59% 
IMG OID642612164 
Producthypothetical protein 
Protein accessionYP_001876745 
Protein GI187734633 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCCGTT CCTCCATCAT GCTTCTGGCA ACCATGCTGT GCGTTTCCTG CGTATCGCAC 
CGGCCTATCC AGGACAGCAG CTCCCCGCCC ATTGACGCGG CCAATCCCCT GGACGGCACC
CCCGTGGCGC TGGCGTGGAG CTCCGGAACG CAATTAATGA TGGGCGTAGA CACGGGAGCA
GTCCAGACAT CCCTGCTTTT CTCCCCGGCA GTGGAATCCA TCGGCGCACG CCTGCGCGGC
AGGGGCGCCA TGCGCACGGC CAACGTGCCC GTTTCCCTGA AGGATGACGG GGAACCCATT
TCCCGGAAAC AGGACGTAGT AATGGCAGAC CAGGCCCCGT ATGACGGTTT GCTGGGCTGG
GAATGCATCC GGAAATATGT GTGGAACATC AACTATCCCA AACGCTCCCA CCGTTTTTTC
AATAAACTTC CCTCCAGAAT AAAAAGCTGG AACAAGCTTT CCCTGATTCC CGGATCCGAC
TATCCGCAAA TCGCGGACAG GCACGGAAGG CGCATCATTC TGGACACGGG AGCCCCCCAC
GCCGTTTACA TCTCCAAAAA ACGCTGGAAT GCCATTAAGC AGGCCTACCC GGATGCGTTC
GTCAGCGTCT ATTCCGGCTA TAGCCCCGCC GCAGGCGGCT TTTACGCCCA CGAATGCATG
CATGTAAGCT CCTTCCAGCT CGGTCCACTG GAATTAAAAA ATATCCTGCT CTGTGAAAGC
TTCGCCAACC CGGAAGTGAT GGGCATCCCC GATGACATCG ACATCATCCT GGGCTACGGC
GCTCTGGCCG CACGCCAGTT CTGGCTGGAC GGCCCGGGGA ACGCCCTTTA TTTCAGCTCC
ACCAGCCACC GGATGCCCGC CCCCGCCTCC TTCAACCTGA TGGGAGGCAC CTTTATCCAG
GACAGCAACG GGAACGGCCC CATGAAAGCT TACGTGGCAG AGTGGTCTCC CGCATGGGAC
GCCGGCCTCA GGACGGGAGA TGTGCTTATT TCCATCAATG GAAGAAAGAA TCCCTATCCG
GACCTCGTAG AATATGTTAC CACCCAGCGG GGGGCTCAGG CCAGCGTGGT GGTCCAGCGC
AGGAACAGGC TGGTGCGCAT CCAATGGGAA GTTCCGGCCG CGCCCCCTGC CGGGGATTAT
TACCCCACGC CCCAGGCCAT TACGGAACAG GAATTCGAAA ACCACGTCAG GCAGCAGGAA
AAAAAAGAAC AGACCCAGCC CTCCGCAGAC GGCCAGCAGC CTCCGGCTAC GGCCGGAGAA
ACTCCGGATG AAGCCTCTCC CGCAGCTGAC GGGAAAACGG ACAAGGCCTC CGCTGCCTGA
 
Protein sequence
MFRSSIMLLA TMLCVSCVSH RPIQDSSSPP IDAANPLDGT PVALAWSSGT QLMMGVDTGA 
VQTSLLFSPA VESIGARLRG RGAMRTANVP VSLKDDGEPI SRKQDVVMAD QAPYDGLLGW
ECIRKYVWNI NYPKRSHRFF NKLPSRIKSW NKLSLIPGSD YPQIADRHGR RIILDTGAPH
AVYISKKRWN AIKQAYPDAF VSVYSGYSPA AGGFYAHECM HVSSFQLGPL ELKNILLCES
FANPEVMGIP DDIDIILGYG ALAARQFWLD GPGNALYFSS TSHRMPAPAS FNLMGGTFIQ
DSNGNGPMKA YVAEWSPAWD AGLRTGDVLI SINGRKNPYP DLVEYVTTQR GAQASVVVQR
RNRLVRIQWE VPAAPPAGDY YPTPQAITEQ EFENHVRQQE KKEQTQPSAD GQQPPATAGE
TPDEASPAAD GKTDKASAA