Gene Amuc_0555 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0555 
Symbol 
ID6275349 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp654455 
End bp655786 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content55% 
IMG OID642612605 
Productprotein of unknown function DUF21 
Protein accessionYP_001877174 
Protein GI187735062 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.149438 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCTGG CAGTTGCCGG CTTTATCCTT TTTTTGTTGT TGAATGCGTT TTTTGCGGCA 
GGCGAGTTCG CCTTGATGAA GGTTCGTGAA AGCCAGCTGC ACGCCGGGGA AGGTGTTCCG
GCCCGAACTC GGAAAAAACT GGCCCGGGCG CGGAAGGCTG CCAAGCATCC TGATCTTTAT
TTGGCCGCCT GCCAGGCGGG CATTACGCTT TCTTCCCTGG CGTTGGGATT CCTGGGAACG
TTTTTTGTAT CGGAACTGAC AGCTCCCTTT CTGGTTTCCC TGGGATTGGG AGGCATGGTT
TCCGTTTACG GAATCGCTCT GGCCGTTACA TTTATTTTCT TTGCCTGCTG CCAGGTGGTA
TTTGGGGAGT TTATCCCCAA GGCTATGGCG ATGCGCCAGC CGGACAAGGC CGCCCTGGCG
ACGGTTCCCC TGCTGTATTT CTTTTATACG GTGTTCAGAT ATACGGGTAT TCTTGGCCTG
ACGGGCGGAA TGGCGCGGTT TGTGCTGAAA TACCTGCTGG GCATAGACCC CCGTTCCACG
GCGTGCACGG TGCACAGCAC GGATGAATTG ATGTATCTGG TGGAAGAAAG CGAACGTTCC
CGCGAGCTGA CGAAGCAGGA GGCTGAAATT TCCAAAAATG CCCTTGAACT GAACGATATG
TGCGTCAAGG ACGTAATGAC CCCGCGCTCT GAAGTGGATG TGATGGATTT GACGGCTCCC
TTTGAGGAAA ACTGGGAGCT TGCCCGGAAA TCCCGCCACA CCCGGTTTCC GCTGGTGGAG
GGAGACCACC TGGATGAAGT GAAAGGCTGG GTGCATGTCA AAGATCTGCT CAAACTGGTA
GGACGGGAAA ATCCGGATCT GAGGAGCGTG CGGCGTGAAT TGCGCGTGGT GCCGGATACG
ATGCCCCTGG ACAGCCTTCT CACGTTCTTT CTGAAAGAAC ATGCCCACTT TGCCCTGGTA
GTGGATGAAT TCGGTGATTC TATCGGCCTG GTATTCCTGG ATGATGTGCT GGAACAGATT
GTGGGGGATG ACATTCAGGA CGAGTTTGAC CAGGAGGAAA TGCGGGAGTT TGTGAAAACC
GGCAAGGATA CATATGCCGT CAATGGGGCT ATTACCCTGT TTGACCTGGC AGATTACCTG
CCTGAAATGG ATTTGGATTG TCCGGGCGTT ACTACGCTGG GCGGTTACGT AATCAGCCGG
ATGGGTTATA TTCCGGAAGA AGGGGAGGAA TTGCGGATTG GCCGCTACCG GGCTGTGGTG
ACGGGGTCTG ACGGCAGGAG AATCACGCAG ATTCTGCTGG CCCGCCTTCC GGAGGAACAG
GAGGAGGAAT AG
 
Protein sequence
MMLAVAGFIL FLLLNAFFAA GEFALMKVRE SQLHAGEGVP ARTRKKLARA RKAAKHPDLY 
LAACQAGITL SSLALGFLGT FFVSELTAPF LVSLGLGGMV SVYGIALAVT FIFFACCQVV
FGEFIPKAMA MRQPDKAALA TVPLLYFFYT VFRYTGILGL TGGMARFVLK YLLGIDPRST
ACTVHSTDEL MYLVEESERS RELTKQEAEI SKNALELNDM CVKDVMTPRS EVDVMDLTAP
FEENWELARK SRHTRFPLVE GDHLDEVKGW VHVKDLLKLV GRENPDLRSV RRELRVVPDT
MPLDSLLTFF LKEHAHFALV VDEFGDSIGL VFLDDVLEQI VGDDIQDEFD QEEMREFVKT
GKDTYAVNGA ITLFDLADYL PEMDLDCPGV TTLGGYVISR MGYIPEEGEE LRIGRYRAVV
TGSDGRRITQ ILLARLPEEQ EEE