Gene Amuc_1355 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1355 
Symbol 
ID6275829 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1627047 
End bp1628744 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content51% 
IMG OID642613411 
ProductBacteriophage tail assembly protein-like protein 
Protein accessionYP_001877960 
Protein GI187735848 
COG category[R] General function prediction only 
COG ID[COG5525] Bacteriophage tail assembly protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00357201 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.000000000207538 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGAGACG TTGAAAAGGC TGCCGGATTT CTGAATCTTT TCGCGGATAA CGTATCCGCT 
GGAATGGACG TGGAGCCGAT AGAATGGATA CGTGAAAACG TGGTAGACCA GCAATCGGCG
CGGTCTACCC ATATTGATTT TACGTTGAGC CCGTTTCTGC TGGAACCCAT TGACAAATTC
CTGAATGACG GCACGGTCAG GCACATTAAC CTGATGGCTC CCACCGGTTC CGGAAAGTCG
ACATTGTTCG TCGGCTTGCT TAATTACCTC ATTGCCAATG ATGCGGGGAA TACCTTGGTC
GCTTTCCAGA ATGAACAGGA AACCTCTGAT TTCGCGGAAA CGAGGCTTTT TCCCACATTC
CGGGATAATA AGGCTTTAAA AAACTTGCTG CCGAAGAAAA GGCATGCGGC CAGGAAAACG
GAGATTCTTT TTCCGCACAT GAACTTGTGG ATGGTCTCCG CTACCAAGGG GCAGTTGCAG
TCGAAGTCCT GCCGGTATTT GATTGGTGAT GAAATGTGGG CGTGGGAAAA GGGAATGGTC
CGGGAGTTCC TGGCCCGCCA CCATGATCGG TTCAACCGTA AAATCCTGAT GGTTTCCCAG
GGAGGGGACA AGGGGACGGA CTGGGTTGAC GAGTACGGAA AGGGCCGCAT CCACCATTAC
CATTGGCAAT GCCCCGGCTG CCAGGGATGG AATGCCTATG ACTGGCGGGA TGTCATTTAC
AGCAAAGAAG AAAATATTGA TTGGGAACGT TTTAAGGAAT CCGTCAAAAT GGTTTGTCCC
CGGTGTTTGC ACGAAATAGA GGATACTGTT CACAACCGGC GCCATCTGGC CGGCGGCGGC
AAGTACGTGT TTTCCGGCAA TACGAGCGCC TTGCCGGAGA TCGTAAGCTA CAATTTTAAT
GCTTTGGCCT GTTACTGGGT ATCATGGGCG GATTTGGCCG TGGAATGGAT CCTGGCCAAT
AAAAAGAAGA GGAACGGGGA TATAGAACCG CTGAAAAAGT TTATCCAGAA GCGGCTTGCT
CAAAACATTG TGGATTTGGG CGAGAAAGAC GACGTGTTGA AAATCCCGCT CACGGCGGAA
AGCATGGAGG GGTACGCCGT GGAAAACGAA CGGGCGCGCT TCTTAACGGT AGATGTCCAG
AAAGGGCACT TTTGGCATAC TGTTTACGGC GTGGATGCCG GAGGGGCTTT CCATTTACTT
TCAGAGGGGC GTCTTGAAAC GTTGGAGGAT ATTGAACGTA AACAGGCGGA GTTTAACGTG
CCTGATCATT GCGTTGCTTT GGACTGCGCT TTTGATACGG ATGCCGTGCG GAAGATATGC
GGGCTTCATC ACTGGTTTTC CATGAATGGG ACGGTCAAGG AAGAATACGT GCATAAAATC
AGGGGGCGGG GGATTAAATT GATTTATGCG CCTCTTGAGC GGCACATGGT GGAGGGGATT
CAATGCCTTC ATTTCAACTT TTCTTCCCAG CGGGCGAAGG ACGTTCTTGC CGCGCGGATT
AAATCGGGGC ATTTCAAGGT TCCCCATGAT GTTTCTGCCG AGTACATCAA GCAGATGCAG
GCTGAAAGCA AACAGGAGTC CATAGACAAG CGCACGGGGA GGGTTTCTCT CAAATGGCTT
GCGTCCGGTA ATAATTCCCA CATGTGGGAC TGTTCCTGCA TGGCGGTGAT TTTTGCCATG
ATTCACCGGA TGATTTAA
 
Protein sequence
MRDVEKAAGF LNLFADNVSA GMDVEPIEWI RENVVDQQSA RSTHIDFTLS PFLLEPIDKF 
LNDGTVRHIN LMAPTGSGKS TLFVGLLNYL IANDAGNTLV AFQNEQETSD FAETRLFPTF
RDNKALKNLL PKKRHAARKT EILFPHMNLW MVSATKGQLQ SKSCRYLIGD EMWAWEKGMV
REFLARHHDR FNRKILMVSQ GGDKGTDWVD EYGKGRIHHY HWQCPGCQGW NAYDWRDVIY
SKEENIDWER FKESVKMVCP RCLHEIEDTV HNRRHLAGGG KYVFSGNTSA LPEIVSYNFN
ALACYWVSWA DLAVEWILAN KKKRNGDIEP LKKFIQKRLA QNIVDLGEKD DVLKIPLTAE
SMEGYAVENE RARFLTVDVQ KGHFWHTVYG VDAGGAFHLL SEGRLETLED IERKQAEFNV
PDHCVALDCA FDTDAVRKIC GLHHWFSMNG TVKEEYVHKI RGRGIKLIYA PLERHMVEGI
QCLHFNFSSQ RAKDVLAARI KSGHFKVPHD VSAEYIKQMQ AESKQESIDK RTGRVSLKWL
ASGNNSHMWD CSCMAVIFAM IHRMI