Gene Amuc_0638 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0638 
Symbol 
ID6274167 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp751262 
End bp752407 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content58% 
IMG OID642612690 
Productglycosyl transferase group 1 
Protein accessionYP_001877256 
Protein GI187735144 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value0.650689 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGATCA AACGCCTCCG TCCCGTATTA GCCATAGCAG CGGACCTCCC TCTGGGCCGG 
CTGCTGACGT CTTTCCGTAA TAAGGACCGG CGTACGATTC CCTGGATTTT TTCCCTTTTT
CATGCTCTGG AATCTCAGGA AGATTTTGAC ATCCACTGGA TCACCCTCAG CAAAGCCGTT
TCTGCCCCGG AAACCATCCG GATGCGCAAC CAGACCATCC ACATTCTTCC CCTGGGCAGC
ATGGGCAGGA ATATCCTGAC GGCCCATTTC CTGACCGTCC GGAGAATACG CAAGACCCTG
AACGATATCC AGCCGGACCT CCTGCACGTA TGGGGCGTGG AGCAGGCTTA CGCTCTGGCG
GGGATTGCCT TCCGGGGAAA GAAGCTCCTT TCCTACCAGG GAGCCCTCAC CGCCTACTGC
CAGCGCGCTC CGCAGGCCTT CCTCCTCCAT ATGCAGGCCC TCTGGGAACG GATGGCCGTC
AAACATTATG ATCTTATCAC GTGCGAATCC CCCTGGGCGT GCGGCCGCGT TGCGGAAATT
GCCCCCCATG CCCGTCTATC CTGCATGGAA TACGGCGTGG AACCTTCCTT TTACCATCTT
GCCAGAAAAC CTTCCCCGGA ACCTTCCTGC CTCTTTGCCG GAACCATTTA CGAGTTGAAG
GGCATTTCCT ACCTGGTGGA GGCCTTTACG CATCCGTCCC TTTCCCATGT CCAGCTGTTC
ATTGCGGGCA ACGGAGCCCT CAGGGAAAGG CTGGAAGCCC TGTCCACTCC CAATATCCAC
TGGCTGGGCA GCATTTCCCG CGCAGAACTT CAGCAGCACC TTTCCACGGC GTGGTTCCTG
GTGCATCCCA CCCTGGGGGA TTGCTGCCCC AACATCGTGA AGGAGGCAAG AGTCATGGGC
CTTCCGGTAA TCACCACGGA AGAAGGCGGA CAGACTCAAT ATGTTCAGGA CGGCGTATCC
GGCTATATTG TCCCTGTCCG CAACAGCGCC GCCGTCAGGG AAGCCGCGCA GAAACTTTCC
GTCAGCCTGG ATAAAGCCAT GTCCATGGGA ATGGAGCGGC ATCAGGAATG CCGCCGCCTG
CTGGACGTAA AGCAGACAGT AACCGGGTGC CTGTCACGTT ATCATACCAT GCTGTATCCA
CGCTGA
 
Protein sequence
MPIKRLRPVL AIAADLPLGR LLTSFRNKDR RTIPWIFSLF HALESQEDFD IHWITLSKAV 
SAPETIRMRN QTIHILPLGS MGRNILTAHF LTVRRIRKTL NDIQPDLLHV WGVEQAYALA
GIAFRGKKLL SYQGALTAYC QRAPQAFLLH MQALWERMAV KHYDLITCES PWACGRVAEI
APHARLSCME YGVEPSFYHL ARKPSPEPSC LFAGTIYELK GISYLVEAFT HPSLSHVQLF
IAGNGALRER LEALSTPNIH WLGSISRAEL QQHLSTAWFL VHPTLGDCCP NIVKEARVMG
LPVITTEEGG QTQYVQDGVS GYIVPVRNSA AVREAAQKLS VSLDKAMSMG MERHQECRRL
LDVKQTVTGC LSRYHTMLYP R