Gene Amuc_2083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_2083 
Symbol 
ID6273865 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2533640 
End bp2534818 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content54% 
IMG OID642614145 
Productglycosyl transferase group 1 
Protein accessionYP_001878673 
Protein GI187736561 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value0.565137 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAACA TCGTCATCTA TCTTCCCAAT GAATCCCTGG GCAAACAGGG AGGCATGGAA 
CGGGCCACTC ATCAGCTGGC CGCCATGCTT GCCGGTGAAG GCCACCGGGT AACGCTCCTC
TGCAGAAACA AAAACAGGCT GGGGGAAGAA TACGTGCCTC CGACGGATCT CGTGTTCATT
CCCGCAGCCC TCAGCCGGAA GGAAGAACAG AATTTCCTGC TTAACCTTAT CCGGGAAAGA
GAAATTGAGA GCATCATTGA CCAGACGGAA GGCGGGATTG TGGGGCGGTG GGGAATCTTC
CGGCATCGCG GGCACATGAA CGGGGTTTCC GTCAAACTCA TTGCCGTACA GCACAGTTCC
CAATATACCT ATCTGAAACA TTACCGGACC GTCAACCGGA GGCCCTCCGG ACGCGGCCTC
ATCGGCAAGG CATGTTCATT TTTCTATCAT ACGCTTATAC TGGCATTAAA AAAATACAGG
GCAATTTTGC TTCAGCGCAG CCTGTTCCGG GAACTGGCTT CAGACTATGA CCGGATAGTG
ACGCTTTCGA AAGGGGGCAT TGAAGAATTC AAAAAACTGT GCCCCTCCGT CCCCGGGAAC
AAGCTCGTCT GCATTCCCAA TATCGTGGAA CCGGCCGTTC TCTCCAAGGA AGAGAGAAAA
GAACCGCGCT GCCTGTTTGT CGGCAGGCTG GACAACCCCT CAAAAGGAGT GGACAGGCTT
CTGCGCATAT GGGAAAAAGT GGAAAAAACA TGTCCGGAAT GGCATCTGGA CATTGTGGGC
GACGGACCGG ATGCAGATAT GCTTAAGGAT TCCGCCCAAA AACTGGGGCT TTCCCGAATT
GCCTTTCACG GCTTCCAGAA TCCGGAGCCT TACTATTCCA GAGCCTCCGT ATTCTGCATG
ACTTCCACGT TTGAAGGGTT CGGCCTTGTT CTTGTAGAGG CCATGCAGCA CGGGTGCGTT
CCTGTTGCCT TTGACAGCTA CCCTGCCGTC CGGGATATTA TCTCCCACGG GGAAAACGGC
ATCCTCGTTC CCCCCTTTCA GGAAGAAATT TACTCGAACG CCCTCACATC CCTTATCAAC
AATCCCGGCG AACTGGAAAA ATTCAGCCGG CACAGCCTCG TCACATCCAG AAACTTCAGC
TCCTCAAACC TGGCGGCCAG GTGGGCCGCC ATTCTGTAA
 
Protein sequence
MSNIVIYLPN ESLGKQGGME RATHQLAAML AGEGHRVTLL CRNKNRLGEE YVPPTDLVFI 
PAALSRKEEQ NFLLNLIRER EIESIIDQTE GGIVGRWGIF RHRGHMNGVS VKLIAVQHSS
QYTYLKHYRT VNRRPSGRGL IGKACSFFYH TLILALKKYR AILLQRSLFR ELASDYDRIV
TLSKGGIEEF KKLCPSVPGN KLVCIPNIVE PAVLSKEERK EPRCLFVGRL DNPSKGVDRL
LRIWEKVEKT CPEWHLDIVG DGPDADMLKD SAQKLGLSRI AFHGFQNPEP YYSRASVFCM
TSTFEGFGLV LVEAMQHGCV PVAFDSYPAV RDIISHGENG ILVPPFQEEI YSNALTSLIN
NPGELEKFSR HSLVTSRNFS SSNLAARWAA IL