Gene Amuc_2065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_2065 
Symbol 
ID6274745 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2508222 
End bp2509376 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content52% 
IMG OID642614127 
Productacyltransferase 3 
Protein accessionYP_001878656 
Protein GI187736544 
COG category[S] Function unknown 
COG ID[COG3274] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.393482 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACACCA ACCAATCATT GAACTCCCGC GGCGGCCACA TCGCCTGGGT GGACTTTCTG 
CGTATTCTGG CCTGCTTCCT CGTTGTCCTA GCCCATTGCT GTGATCCGTT CGTTGGAAGT
TTTGACGGTT CCTTCAACTT TAAATCCGCT GTCTTCTGGG GAAGTCTGGT ACGGCCGTGC
GTGCCTTTGT TTGTCATGAT CTCCGGAGTG TTGCTTTTTC CCGTCACCTT GGAAATGGGC
GCTTTTTACT CCAGGCGCCT CAAGAGGGTG CTGGTTCCGC TCATTGTCTG GTCACTGGCG
CTTCCCTTGC TCTACTTCGG ATACTTTGCC GCAGGCGTTC AAACGGCCAG CCCCAACATC
GTGATGGACA CTTATACTTG GAGTGCCACC GTCGGCAAGC TGTATACCTT CTTCTTCAAC
TTCAACTATG ATACCACGCC CCTGTGGTAT GTATACATGC TGGTAGGCCT GTACCTCTTC
ATGCCCATCA TGAGCGCGTG GCTGACGCAA GCCAGAAGGA AAGATGTGAA AATCTTCCTG
GGCATCTGGA TATTCAGCAT GACTCTCCCC TACATCCAGA TGCTTGCTCC GGCACTGGGT
TATGAGGGCA ATTACGGCAA CATGGGTATT CTGGGTGTTT GCGATTGGAA TCCGTACGGT
ATGTTTTATA ACTTTTCCGG ATTCCTGGGA TACATGGTGC TGGCGCATTA CCTGACCAAA
TACCCACTGG CCTGGAGCTG GAAAAAAACG CTGTCCATTA CTATTCCCCT CTTTTTGATT
GGTTTTGCCG TTACGTTCTT CGGCTTTCTG GAAACACAGA AGCACTTCCC CGGCCAGTAT
TCCAAGCTGG AAGTGCTCTG GTATTTCTCC GGAATCAATG TATTCCTGAT GACCTTTGCC
ATCTTTGCCG TCGTCAGCCG GCTCAGAATC AAGGCTGGTC CAGTGCTGAG CAAGGTGGCG
GCGCTTACTT TCGGCGTGTA TCTGTGCCAC TTCTTCTTTG TCCAGTGCTC CTATGACTTC
GTGAACTTCA TCGGGCTGGG AGGGCTGCCC TCCGCCGTGA AAATTCCGTT GATGGCCTGT
CTGGCCTCCG CTGTCTCCGC GGCGTTGGTA TGGCTGTTGA GCCTGAACAG GTGGACGCGC
AAAAGCATCA TGTAA
 
Protein sequence
MNTNQSLNSR GGHIAWVDFL RILACFLVVL AHCCDPFVGS FDGSFNFKSA VFWGSLVRPC 
VPLFVMISGV LLFPVTLEMG AFYSRRLKRV LVPLIVWSLA LPLLYFGYFA AGVQTASPNI
VMDTYTWSAT VGKLYTFFFN FNYDTTPLWY VYMLVGLYLF MPIMSAWLTQ ARRKDVKIFL
GIWIFSMTLP YIQMLAPALG YEGNYGNMGI LGVCDWNPYG MFYNFSGFLG YMVLAHYLTK
YPLAWSWKKT LSITIPLFLI GFAVTFFGFL ETQKHFPGQY SKLEVLWYFS GINVFLMTFA
IFAVVSRLRI KAGPVLSKVA ALTFGVYLCH FFFVQCSYDF VNFIGLGGLP SAVKIPLMAC
LASAVSAALV WLLSLNRWTR KSIM