Gene Amuc_0241 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0241 
Symbol 
ID6275276 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp298195 
End bp299637 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content56% 
IMG OID642612289 
ProductSel1 domain protein repeat-containing protein 
Protein accessionYP_001876865 
Protein GI187734753 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.381439 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTGTT CCGGTATGAA GAAACATTGC AGATGGTGGC TTTTTGCCGG ATTGTGGGGA 
CTGTCCCTCG TTCAGCATCC ACTTCTTGCT TCTCAGAATA TTTCCTCTCC GGAGCCTGCC
GGGGAGGGAA ATTTTGAGGC TTACTTGTCC GGTCTCCGCG AAAGAGCTCT GAATGGGGAT
CCAGCTGCCG CCCGGGAGCT GTCTGTGCAT TATGACGTGG AGGGTAATGC TGCGGAAACG
TCCAAATGGA TGTCCGAGTA TGTTTCCTTG GCAGAGAAAA GGGCCAATAA CGGAGATGTG
GATTCCATGC TGGATTTGGG CAAGCTGTTT TATACAGGCA GCCGCCTGTA CCCAAAGAAC
CTGGAAAGGG CCCGGTACTG GTTTACCCGG GCTGCTGACA GCGGTAATGC CGCGGCACAG
TACCAGGTGG CTGTAATGGC TTCCCAGGGA GCAGGAGGAC CGAAGGATGA AGCAACGGCA
GCCCTTTATT ATAAAAAATC TCTCCAGACG TGGAAGAAAG AGGCGGATGA CGGTGATTCC
AAGGCGGCGT TATGGGCTGC CCTTGTTTAT GAACGAAAGC TGGTTCCGGA CAGTTCTCCG
GAAAAGTCAG TCCCATATCT TCTTCAGGCA GCGGAAAGCG GCAACCTGAC AGCACAAGGC
CTTCTGGCAT TTAAGTACCG GGATGGGCTG GGAGTGCCGC AGGATGCGGC CAAGGCCGTA
GAATGGTTTG AGAAAGCGGC CTCCCGTAAA GATTTGGGAG CCGTGATGGA ACTGGGCATA
ATGTTCCGGG ACGGCAAGTA TTTGCCCCCT GACCGGGAAA AGGCCTTCCA TTGGTTTGAA
AAAGGGGCGG AATGGAAGGA TCCGTACAGC ATGGCTGCCC TGGCGGATAT GCTGCTGGAG
GGAACTCCTT CCGCAGAACA GGCGGCCCGG GCCCTGGCTC TGTATCGTGA GGCTGCCGCC
GCCGGTTATT TCCCTGCGGC ACTAAAGGCC GCGGAGCTGC TCCAGAACGG GAAGGGCGGG
GAACTGGATG CGGATGAGGC CTACAGGCTG CTGCGGCGTG TGGCGGATGC TACAGGGGAT
CCCAAGGCCA TGTACATGCT GGCCCAGGTA TATTATACAC GGGGTGATGA GGCTCAGGGA
GATTCCCTGA TGAAAGCATC CGCCCAGGCT GCCTATTTGC CGGCCATGAA CCGCATGGCG
CGTCTCCATC TTCTGCCGGA CAGTTCACTG CCCTGGAATC CGGTTTTATC CTATTATTAT
TGGAACCAGG CTGGAGAAAT GGGGGATGAA AAGGCGGCTT CCGCCGCTTT TTGGCTGTTG
TGGGGCGGCT CAGGCATCTT TTTGCTGGCA ATATTTATTA TTGTCTGGCG TTTTCAGCGT
TTTGCCGCCA GAAGGCTTGC GGAACAGCAG AAACAGGAAC GGGAGGCCTC TGATGACGCA
TGA
 
Protein sequence
MTCSGMKKHC RWWLFAGLWG LSLVQHPLLA SQNISSPEPA GEGNFEAYLS GLRERALNGD 
PAAARELSVH YDVEGNAAET SKWMSEYVSL AEKRANNGDV DSMLDLGKLF YTGSRLYPKN
LERARYWFTR AADSGNAAAQ YQVAVMASQG AGGPKDEATA ALYYKKSLQT WKKEADDGDS
KAALWAALVY ERKLVPDSSP EKSVPYLLQA AESGNLTAQG LLAFKYRDGL GVPQDAAKAV
EWFEKAASRK DLGAVMELGI MFRDGKYLPP DREKAFHWFE KGAEWKDPYS MAALADMLLE
GTPSAEQAAR ALALYREAAA AGYFPAALKA AELLQNGKGG ELDADEAYRL LRRVADATGD
PKAMYMLAQV YYTRGDEAQG DSLMKASAQA AYLPAMNRMA RLHLLPDSSL PWNPVLSYYY
WNQAGEMGDE KAASAAFWLL WGGSGIFLLA IFIIVWRFQR FAARRLAEQQ KQEREASDDA