Gene Amuc_0939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0939 
Symbol 
ID6274226 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1118530 
End bp1119645 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content58% 
IMG OID642612993 
Productglycosyl transferase family 8 
Protein accessionYP_001877552 
Protein GI187735440 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1442] Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones75 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTCC CCGATGCCAT GACAACCCCG CCCGTTCCCG CATCCCCGGA GAAATCACGC 
ATCCCCGTCA TGTTTTCCGC CACCGGCGGC TGGGGCCTGC CCCTGGGCGT AGCCATTCAC
ACCCTTTGCC TTCACGCCAG TTCCGGACGC TTTTATGACA TTCACATCGT TCACGACGGA
ATGGACGCGC GAATAATACA GGAGCTGAAC CAGGTTGCCG CCCCCTTTCC GCAGGTTTCC
CTTTCTTTCC TGCAACTTCC GGAAGAATTC CGCCATCTCT TTCAAAACGG CAACAAGGAC
CGCTACTCCC CCCTTGCGTA TGCCCGCCTG ATGGCCGGCA GCCTGTTCCC GCAGTACGGC
AGGATCGTTT ATCTGGACGC AGATGTCCTG CTGGCCGGAG ACGTAGCCGA ACTGTATTTT
TCCGATTTGC GGGGAGCTTC CGTAGCGGCG GCCGGAGACG GCCTGGCCCT CTGGAGCATT
GAAAAAGGAA CGATGCACCC CCATCTGGAA TATATGGGCA ACTACCTTTC CTTCCCCCTT
TCCTACTGCA ATTCCGGCGT CCTGGTGCTG GATCTGGACC AGATGCGCCG CCGCAACCTG
GAACACCGGC TGCTCCAACA GCTCCGGAGC CGCCCGGACC CCTTCCCCTA TCCGGACCAG
GACATCTTAA ATATCGCCCT GCACGGAGAC ATGACGACGC TGCCTCCGGA ATGGAACTTC
CAATTCCTGT CCTGGACCTG GGATGAAGAA AAAACACGGC TCCTGCGCGG AACCGAATTT
GAAAACGTTC CGACCATATC CTGCGGGCGT TCATGGAAAC TGCTGCACAT GGTAGGCCCG
GAAAAACCAT GGCGGCTCCC TGACACCCCC GGAACCATGG GGCAGTTCCA CTGGATCCTG
TACTCCTTTT TCTGGTGGCC GGAAGCAAAG AGGCTTCCCG TGTTCCGGGA GGAACTGGAT
GCGATTTCCC AGGGGCTGGC CCCGCTCCTC CAGCGCCATA TCCGCGGCCA GCAATGGAAA
CTGTTCTTCT CCCGGGGCCA TATTTTCCGG AAACGCCGGG ACAAGATCAG GTGGCTGAAA
AAATTGCTGT CCATTCTTGA CGGCAGAAAA CCGTAA
 
Protein sequence
MKFPDAMTTP PVPASPEKSR IPVMFSATGG WGLPLGVAIH TLCLHASSGR FYDIHIVHDG 
MDARIIQELN QVAAPFPQVS LSFLQLPEEF RHLFQNGNKD RYSPLAYARL MAGSLFPQYG
RIVYLDADVL LAGDVAELYF SDLRGASVAA AGDGLALWSI EKGTMHPHLE YMGNYLSFPL
SYCNSGVLVL DLDQMRRRNL EHRLLQQLRS RPDPFPYPDQ DILNIALHGD MTTLPPEWNF
QFLSWTWDEE KTRLLRGTEF ENVPTISCGR SWKLLHMVGP EKPWRLPDTP GTMGQFHWIL
YSFFWWPEAK RLPVFREELD AISQGLAPLL QRHIRGQQWK LFFSRGHIFR KRRDKIRWLK
KLLSILDGRK P