Gene Amuc_1098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1098 
Symbol 
ID6274003 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1310567 
End bp1313290 
Gene Length2724 bp 
Protein Length907 aa 
Translation table11 
GC content59% 
IMG OID642613149 
Producttype II and III secretion system protein 
Protein accessionYP_001877705 
Protein GI187735593 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1450] Type II secretory pathway, component PulD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0580477 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.000681519 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGACCACG CACCACTTTA TCAACCGAAG CGTTCCCTGA TCGCGCTGAT GGCCATTGCT 
GCCTCCTGCC CGTTTGCGCA GGCTGGTGAT GGCGGCGCCG TCGGAACCTC CAGTGCCTTG
GGATCGTCCT ACGCCGGACC GGGCAGTTAC CAATACCAGT CCAGTGCAGC GCGCACAGCC
ATGGCCCGCC GCGAAGCGCA AACCCAGGAA GCCATGCAGC TCCTTGCGGA AGGCCGCAAC
CTGTACCGCG AAGGCAAGTA CAAGGAAGCC CTGGACAAAT ACAACGCCGC TTACAACATG
CTGCCTTCCG CGCCGATCAA CGACCAGCGC AAGGAAGCTA TCGCCAACCA TATTGGCGAC
GCCAGCATCG CCGTTGCTCA GGAATACATC AAAGTGGGGC GCTATGACGA AGCCGACAAG
CTTCTGCAGG ATGCCATCAA GCTCAATCCC CGGAGCGCCA AGCTTGCCAA GCAAACGCTC
GAATACATGA AGGACCCGAT TCGCACGAAT CCGGCGCTCA CCCCCGAACA CGTGAAAAAC
GTGGAAAAGG TGAACACCCT CCTTCACATG GCCTATGGTT ACTATGACCT CGGCGACTAC
GACAAAGCCA TCGCCGAATT CAACAAGGTT CTCTCCATCG ACCCGTACAA CGTGGCGGCC
CGCCGCGGAC AGGAAACGGT CAACCGCCGC AGAATGGCTT ACTATGCAGC CGCTTACGAC
GAAACCCGCA GCACCATGCT GGCGGAAGTG GACAAGATGT GGGAACGCCC CATCCCGATG
GAAGTCCCGA CCGGAGCCGA CGGCACTGAC AACGCACCGA TCACGGACAT CAACGGCGCC
ACGGCCAATC TGATGAAGCT CAAGAGCATC ATCATCCCCT CCGTCTCTTT TGAAGACACC
ACCGTGGAAG ACGCCATTGA CTATCTGCGC AAAAAGTCCA TTGAACTGGA CCGCACGGTA
GGCCCGAACG GTGAACGCGG CATCAACTTT GTCATCAATG ATTCCCAGCC CGCTGCCGTC
GCCCCCGCCG TTCCCGCAAC TGATGAAGAC GGCTTTGGCG AAGAAACTGC GGAAGTCACG
GAAGCTGCTC CGGCAGCAGC CCCGCAGGAA AGCATCCGCA CCCGCAAAAT CGGCCAGTTG
AAGCTGACTA ATGTTCCCAT GCTGGAAGTG CTCCGTTTCA TCTGCAGCAA TGCCGGCCTG
CGCCAGAAGG TGGAAGACTA TGCAGTGACC ATCCTTCCTG CCGGCGGCAA TGACGTGGAT
CTGTACCAGC GCACCTTCTC CGTGCCCCCG GGCTTCCAGT CCGCTCTCCG CACCACCGTC
GGCGACGGCG GCGGCGAAGT CAGTGACGAC CCCTTTGGCG GCGGTGGCGA AAGCTCCTCC
GGCCTTAAGC CCATGCCCTC CATCCGCAGC CTGCTGCAAA AGAGCGGCAT CAGCTTCCCG
GAAGGCGCCA CGGCATTCCT TGTCAACGGC AATTCCTCCC TGGTCGTCCG CAACACTTCC
GGCAACCTGG ACCTGATCGA ACAGCTCATT GAAAACACCC GCGGCGAATC CCAGCAGGTG
CGCATCATGA CCAAGTTCGT GGAAGTAACC CAGGAAAACA CGGAAGAACT CGGCTTTGAC
TGGATTGTCA CCCCGTTCTC CGTAAGCAAT GACCGCAGCA CCTTCCTGGG CGGCGGCACG
AACTACGGCA CCGGTTCCAC TTCCGACAGC TTTACCCAGT CTCCCGGCGG CGTGACCGGC
TGGCCTGTAA ACAGCGGCAG CGACACCATC AACGGCCTCG TTACCGGCGG CAACCGCACG
GGTGACTACG CCATCACCAA GAACTCCGTG GACAATCTGC TGAACAGCAC CAACCGCTCT
GAAGCCTCCC AGAAAAACGC CGCTCCCGGC ATCATGTCCC TGACGGGCAT TTATGACGAA
GGCTCCTTCC AGATGCTGAT GCGCGGCCTG TCCCAGAAAA AAGGCTCTGA CGTCCTCACC
GCCCCCAGCG TGACCGCCAA GTCCGGTGAA ACCGCCAAGA TTGAAATCAT CCGCGAATTC
TGGTATCCCA CCGAATACGA ACCGCCGGAA CTCCCCAACT CCGTAGGCAA CAGCGGCTAT
AACAACGGTT ACGGTTATGG GAACAACGGT AACATCGTGG ACGGCCTGCT GGGCAATCAG
ATCCAGCCCC AGATATCCAG CTTCCCCGTC ACTCCCGCCA CCCCCGGCGT GTTTGAAATG
AAGCCCGTCG GCGTAACTCT GGAAGTGGTG CCTACCATTG GCGACAACAA GTACATCATC
GACCTGAACT TCAAGCCCAG CATCGTGGAA TTTGAAGGCT TCGTGAACTA CGGCAGCCCG
ATCCAGTCCA CCGGCGTTGG TTCCGACGGC AAGCCGATGT CCCTGACGCT GACGGAAAAC
CGCATCGAGC AGCCGATCTT CTCCAAGAGG TCCGTTGAAA CGTCCCTGTT CATCTACGAC
GGCCATACCG TGGCAATCGG TGGTTTGATC ACGGAAAACG TGCAGACGGT GGAAGACAAA
GTGCCGATCT TCGGTGACCT GCCTCTCATC GGCCGCTTCT TCCGCAGCAA CTCCGACAAC
CACATCAAGA AGAATCTGAT GATCTTCGTA ACGGGACAGA TCATTGACGC CACGGGCCAG
CCCGTACGCG GCAATGCCCT TCCCACTGCG GCCGCTCCGG AAAGCGCCCT TCCCGCCTCC
GAAGGCCTGC TGCCTCCCAT GTAG
 
Protein sequence
MDHAPLYQPK RSLIALMAIA ASCPFAQAGD GGAVGTSSAL GSSYAGPGSY QYQSSAARTA 
MARREAQTQE AMQLLAEGRN LYREGKYKEA LDKYNAAYNM LPSAPINDQR KEAIANHIGD
ASIAVAQEYI KVGRYDEADK LLQDAIKLNP RSAKLAKQTL EYMKDPIRTN PALTPEHVKN
VEKVNTLLHM AYGYYDLGDY DKAIAEFNKV LSIDPYNVAA RRGQETVNRR RMAYYAAAYD
ETRSTMLAEV DKMWERPIPM EVPTGADGTD NAPITDINGA TANLMKLKSI IIPSVSFEDT
TVEDAIDYLR KKSIELDRTV GPNGERGINF VINDSQPAAV APAVPATDED GFGEETAEVT
EAAPAAAPQE SIRTRKIGQL KLTNVPMLEV LRFICSNAGL RQKVEDYAVT ILPAGGNDVD
LYQRTFSVPP GFQSALRTTV GDGGGEVSDD PFGGGGESSS GLKPMPSIRS LLQKSGISFP
EGATAFLVNG NSSLVVRNTS GNLDLIEQLI ENTRGESQQV RIMTKFVEVT QENTEELGFD
WIVTPFSVSN DRSTFLGGGT NYGTGSTSDS FTQSPGGVTG WPVNSGSDTI NGLVTGGNRT
GDYAITKNSV DNLLNSTNRS EASQKNAAPG IMSLTGIYDE GSFQMLMRGL SQKKGSDVLT
APSVTAKSGE TAKIEIIREF WYPTEYEPPE LPNSVGNSGY NNGYGYGNNG NIVDGLLGNQ
IQPQISSFPV TPATPGVFEM KPVGVTLEVV PTIGDNKYII DLNFKPSIVE FEGFVNYGSP
IQSTGVGSDG KPMSLTLTEN RIEQPIFSKR SVETSLFIYD GHTVAIGGLI TENVQTVEDK
VPIFGDLPLI GRFFRSNSDN HIKKNLMIFV TGQIIDATGQ PVRGNALPTA AAPESALPAS
EGLLPPM