Gene Amuc_0637 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0637 
Symbol 
ID6274169 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp750079 
End bp751239 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content54% 
IMG OID642612689 
Productglycosyl transferase group 1 
Protein accessionYP_001877255 
Protein GI187735143 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value0.948946 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGCCT ATGAAATGGC CTGCAGCCTG AGCTCCCTGG ACTGGGATAT CCACGTCCTC 
ACCCGCCCGC TGGATTCACC GGATGAGTCC TTCCGCTCCC ACCAATACAG CCCTCTCACA
ACCCTGCCGG ACACCCTGAC GAAGGACAGT GCCCGTTTAA GGGAATACCT GGACGGGTTT
CTGAAAGATT TCCGGCCGGA TGTTATTATC TACCACTCCT GGGCGGACTG GTGCCGGGAA
GAATTGCTGG ACGCGGCACG GGATTCAGGC ATTCCGTTTT TCCTCCGCTC CCATGGTGCA
GCTACCAATT TCCGCTCTTT TTTCCGCTTC AATTACCCTC CTTTCTTCGG CCTGAAAAAA
TGGCTCTGCT CCTTTTTTCA AGTACGCAGG GATATCCTCA ACGTATGCCG GAAATCACCG
TTAAACCGTC TCGTTTTTCT CGATCCTTAC GGGACACTGT TCAAGAGCTT TGATTATTAC
TGCGCATCCA GAAGCAAACT TGCCCATTAC TCCTGCATTC CCAACACATT CCCGGCTCTG
AAAAGAACCG CTCCTTTTTT CCGGGAAAAA TACGGACTTT CCTCCGCCCC CGTTTTTACC
TGCCCTGCCG GCGCCAGCAT GAGGAAACGG CAGCTTCTCT TCATCCGCCA TGTGAAACGC
TCCCGTCTGC GGCATATCAT TTTTCTTTTT CTGATTCCCC AGCACAATGC CTACGCGGAA
CAAATGGAAC AAGCCATCGG GGATGACCCC AGATTCAGGA TTCTCTACAG GCTCCCCCGT
TTGGAAGTAG AAGCCGCCAT TATGGAAAGC GATGCCGTTT TCCTTTACTC TTATCAGGAA
CAGCAGCCCC TCTCCATTCT GGAGGCGATG TCATGCGGCG TTCCCTGGTT CGCTCCGGAC
GCAGGAGCTC TTTCCACCCT GGAGGGGGGA ATCGTCCTGA AAAACACTTC CCCCTCCGTG
CTGGAAAAAG CCGTGGAATC GTTGACGGAC GAAAAAACAC GCAAACTACT GGGGAGTAAA
GGCCGCCGGC AATGGGAAGC CTGTTTCGCC CCCGACGCAG TAAACCAGGA ATGGGAGCAA
CTGCTTTTTT CCTCCATCCG CCCGGAAGGA AAGCCTCCGT TCGCGTCTTC CATCGTCCGG
GAGCATTTAC CCACCTGCTA A
 
Protein sequence
MAAYEMACSL SSLDWDIHVL TRPLDSPDES FRSHQYSPLT TLPDTLTKDS ARLREYLDGF 
LKDFRPDVII YHSWADWCRE ELLDAARDSG IPFFLRSHGA ATNFRSFFRF NYPPFFGLKK
WLCSFFQVRR DILNVCRKSP LNRLVFLDPY GTLFKSFDYY CASRSKLAHY SCIPNTFPAL
KRTAPFFREK YGLSSAPVFT CPAGASMRKR QLLFIRHVKR SRLRHIIFLF LIPQHNAYAE
QMEQAIGDDP RFRILYRLPR LEVEAAIMES DAVFLYSYQE QQPLSILEAM SCGVPWFAPD
AGALSTLEGG IVLKNTSPSV LEKAVESLTD EKTRKLLGSK GRRQWEACFA PDAVNQEWEQ
LLFSSIRPEG KPPFASSIVR EHLPTC