Gene Amuc_0803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0803 
Symbol 
ID6274374 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp942092 
End bp944230 
Gene Length2139 bp 
Protein Length712 aa 
Translation table11 
GC content55% 
IMG OID642612853 
Productcoagulation factor 5/8 type domain protein 
Protein accessionYP_001877417 
Protein GI187735305 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.969227 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTATGC GCTGTCTTTT TTTTCCTCTG TGTTCCGCTA TTGCCTTAAC GTCTTTTGTC 
CAAGCCCAGT CCCTGGAGAG GCATCCCCTG TATGAACCCG CCCTTGTTTC CGTAGGAACT
CCGGGTGATG CAGGTAATTT ATTTGCCGGA GCCAAAGTAA CCGCTTCCGG TCATTACGGC
AGTGACCGTC CGGAACTGGC CGTGGACGGG CAGACGAATA ACGCCGGCAA ATACTGGGGT
TGCGAGGGCA TTCCCGTCTG GCTCCAGATA GATATGGGAA AGCCCCGGAC GCTTTCCTCC
CTGCATGTAT GGCCCTATTG GGAGGGGGGG CGCATTTACA AATATAAAAT AGAAGGTTCC
GAAGACGGAA AGAACTGGAA AATGTTGGCG GATCAGTCTT CCAACAGTAT TGCCGCCACT
CCGGAAGGCG TTCCCTTCAA ATTCAATCCG CAGACGGTCC GTTACGTTAA AATCACTTTT
TTGGGCAATA GTGCCGGCAA TGAGAAGGGG GGGCATCTGG TGGAAATCAA AGGGTACGGA
CCCGATGCCG CCCTGAGCCT GCAGGCTGCA GCGGTGAAGG ATTATGACCG TATTCCTTAC
AATGGCGCTC CCGGGCAGGA AATGCTTCAG GATGCCGTGC GTCTGTCCGG CTGGAGAGGC
GAACGCGCCG GCGGGCAGAT TGCCGTCTGG TCTTCCCAGG CGCAGCCCCA GTTATCGGCC
TCCTGCGCCG GGGTAAAAAA CGCTGCCGGA CAGATTATCC CTGTCCGGAC TGCCATGATA
CGCTACACCA GGGGGGGCAA CAGGCTGATT TCAGATATCA TCGGCAGCGA AAACAGCTGC
GATTTGCAAG CCGGAGGCGT GCGTCCGGTA TGGGTGGAGG TCAATATCCC CCCGTCTGCC
AAACCCGGCG TGTACAAAGG AAAAGTGGTA GTTTCCGCAG AAAGCGGCTC TCCCGTCAGT
GTGCCGGTAA TTCTGGAGGT GGCGCCGGAA TCCCTCCCTG CTCCCGCGCA TTGGCAAGTT
CATCTGGACT TGTGGCAGCA TCCGCAGGCC GTGGCCCGCT GGCATGATGT GGAACCGTGG
TCTCCGGAGC ATTTCGCACT GATGAAGCCG GTGATGAAAA GGCTGGCGGA TGCCGGGCAA
AAGGCCATTA CATGCTCCCT GATTGATGAA GCCTGGAATG CCCAGACTTA TGACTGGTTC
CCGCCCATGA TTGAATGGGT CAAGGGCAAA AACGGAACCA TGCGCTGGAA TTACGCCAAT
TTTGACAAGT GGGTTTCCTT CATGATGAAC GAGGTGGGCG TCAAGGGGCA GATATCCTGC
TATACCATGA TTCCCTGGAA CATGAAAATC CGTTATTTGG ATGAGGCGAC CGGAAAGTAC
AAATTTCTGG ATCTTAAGCC GAACGATCCC TCCTATGAAG CTATCTGGGG GCCTTTCCTG
ACGGATTTTC GCAAGCACGT CAAGAGCAAG GGCTGGCTGG GCAAAACCTG CATCGGGCTG
GATGAACGGC CGGACGCTAT GGTCAGGGCG GCCAAGAATG TACTGGACAA GTATGCCCCG
GAATTCAAAA TCGTTTCCGC CGTCAATCGG CCTACGGCCA TGACCCGGGA CGTTTATGAC
GTCTCTCCTG TAATTGACCA TGCAGGCACG GTCACGGGCG ATCTGCTGGC GCAGCGCAAA
AAGGAGGGAA AAAAGACGAC GTTCTATGTC TGTGTCCATC CCAAAAAGCC CAACACCTTC
ACTATTTCTC CGCTGGCGGA GGCGGAATGG CTTCCCCTCT TTGCCGCCGC CAATCATTTG
GACGGCTTTT TGAGATGGGC TTATAATTCC TGGAACCGCA ATCCGTTTGA AAAGACGGAT
TTCGGGAACT GGCCCGCGGG AGACTGCTAC CTTGTTTATC CCGGCAATCT CAGTTCCCTG
CGGTTTGAAA AACTCCGGGA CGGACTGGAG GAATTTGAAA AGGTCAATAT CCTGCGCGCC
CGCGCCGCAA AAAATCCTAA GGCGAAAGCT GCCGTAGCCC GCATGGATGA AGAGCTTTCC
AAGCTCTTTA CCGTGGAAAA AAGCCGCGGG GATTCCCATG AGGAAGACGT GTGGAAAGCC
CGCGAAATTA TCCGTAAAAC GACGGAAATT TCCCGCTGA
 
Protein sequence
MRMRCLFFPL CSAIALTSFV QAQSLERHPL YEPALVSVGT PGDAGNLFAG AKVTASGHYG 
SDRPELAVDG QTNNAGKYWG CEGIPVWLQI DMGKPRTLSS LHVWPYWEGG RIYKYKIEGS
EDGKNWKMLA DQSSNSIAAT PEGVPFKFNP QTVRYVKITF LGNSAGNEKG GHLVEIKGYG
PDAALSLQAA AVKDYDRIPY NGAPGQEMLQ DAVRLSGWRG ERAGGQIAVW SSQAQPQLSA
SCAGVKNAAG QIIPVRTAMI RYTRGGNRLI SDIIGSENSC DLQAGGVRPV WVEVNIPPSA
KPGVYKGKVV VSAESGSPVS VPVILEVAPE SLPAPAHWQV HLDLWQHPQA VARWHDVEPW
SPEHFALMKP VMKRLADAGQ KAITCSLIDE AWNAQTYDWF PPMIEWVKGK NGTMRWNYAN
FDKWVSFMMN EVGVKGQISC YTMIPWNMKI RYLDEATGKY KFLDLKPNDP SYEAIWGPFL
TDFRKHVKSK GWLGKTCIGL DERPDAMVRA AKNVLDKYAP EFKIVSAVNR PTAMTRDVYD
VSPVIDHAGT VTGDLLAQRK KEGKKTTFYV CVHPKKPNTF TISPLAEAEW LPLFAAANHL
DGFLRWAYNS WNRNPFEKTD FGNWPAGDCY LVYPGNLSSL RFEKLRDGLE EFEKVNILRA
RAAKNPKAKA AVARMDEELS KLFTVEKSRG DSHEEDVWKA REIIRKTTEI SR