Gene Amuc_1788 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1788 
Symbol 
ID6274670 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2175508 
End bp2177031 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content60% 
IMG OID642613851 
Producthypothetical protein 
Protein accessionYP_001878387 
Protein GI187736275 
COG category[S] Function unknown 
COG ID[COG1262] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0316215 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.102482 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGATC TTCCCCACAA CTTTGGCGAT TACACGCTGG TGGCATTTAT TGGCCGCACG 
CGCGGAGGAA TTCTTTATCA GGCCATCCAG CAGGGCATGG ACCGTTCCGT GTTTCTGGAG
TTGCTGGATC CGGACAATCC GGAGGGGGTG GGCGTGGAGG ATTTTCTGAT GAAGGCCCGC
ACGCGCGCCG CCATTAATGC TCCGGTGCTG GGCACCGTCT ATGAAGCGTC CCAGGCCCAG
GGATACTGGT TCGTCACCAG CGAACAGCTG GGAGGCTCAT CCCTCCAGTC CATGCTGGAC
CGCGGGCAGA CCCTGTCCAT GAAGGATTTG CTGAAGGTGA TTGAAACGGT CGGCAACGTG
TGCGGCAGGT ATGAGCGTCT GCGGACCGCT TTCAATATCA TGGAGCCGCG TCACATTTTC
CTGGATGACA AGTCTGCCGT GCGGCTGATG AATACGGCCG TGCCCGGTGA CTTTCACGAA
GAAACATCCC GCGACCAGAT GAAGCGCCTG GGCATCGACC TGCCTCCTCT GGTGACTCCG
GATGTGCCCG GAACCACGCG GATGCGCACC CTGCTGGAAT GGATGAGAGA AGGGCAGAAC
GGAAAGCCGA TGCAATGGGA CCAGGTTATG GAGCTGGTTG CTGCGGTCCG CGAACAGCTG
GGGCTTTCCC CTCGGGTTAC CACCCACCGC TATACCGTCC CCGTGGAATC CCGCAGAAAA
GCCGGAAAAA AGCTGCTTTG GGCCGGAATG GGCCTGCTGG GCGTGGGGAT AGCCGCGGCT
GCCGTTCTTC TTTTATTCCC CCGGGAGAAG GAAACCGTAT CCCGTCCCCA TGTTCCGGCT
CCTAAACATT ATCCGGATTT TTCCTCCAGC GATCATACGG AAGTCCGCGT AACTCTTCCC
GGAGGAGGGG AACTGATTGT GGGAGCCCAT GAAATTACCC TGGAATCCTA CCGCCTGTTT
TTGGACCAGT GGGCACGCCT CACTCCGGAG CTGAGGGAGG AATATTCCCA CCCGGACCAG
CCGGACAAGA GGACTGCCTC CCATATTCCC CAGGACTGGG AGGCTATGTG GAAGGCGGCT
TCCACTCCGG GAGGCAAATG GAAAGGCAGG AAAATAACAC CCCGCTCTCC CGTCGTCAAC
GTCACTTTCT GGGATGCCTG GGCGTATGCG AGCTGGAAGC CCGTGGCCCC CGGAGAACCG
CGCTACCGCC TGCCCGAGCG CGGGGAGTGG ATGGCTCTGG GGAATATGCT GGAAACGGGT
GAGAAGGGGG ACAGGACGCT GGTGATTGAC AGATACAGCA ATGATCATGA TTTAAAGACC
GGCGTGTGCG GCATGGCTTC CGGCGTGATG GAATGGACTT CTTCCATGGA AAAGGATCCT
GCGCGCGTGA AGGAGCCTCC GGGGCCGGTG GCCTGCGGCG GAGACTGGAG GCAGCCCGGC
ATCTCCAACC GGGTGGAATA CCTGCGCTCC CGCGGTGAAA GCCGGGACAA CCTGGGATTC
AGAATCGTGC GGAATGTCCT CTGA
 
Protein sequence
MFDLPHNFGD YTLVAFIGRT RGGILYQAIQ QGMDRSVFLE LLDPDNPEGV GVEDFLMKAR 
TRAAINAPVL GTVYEASQAQ GYWFVTSEQL GGSSLQSMLD RGQTLSMKDL LKVIETVGNV
CGRYERLRTA FNIMEPRHIF LDDKSAVRLM NTAVPGDFHE ETSRDQMKRL GIDLPPLVTP
DVPGTTRMRT LLEWMREGQN GKPMQWDQVM ELVAAVREQL GLSPRVTTHR YTVPVESRRK
AGKKLLWAGM GLLGVGIAAA AVLLLFPREK ETVSRPHVPA PKHYPDFSSS DHTEVRVTLP
GGGELIVGAH EITLESYRLF LDQWARLTPE LREEYSHPDQ PDKRTASHIP QDWEAMWKAA
STPGGKWKGR KITPRSPVVN VTFWDAWAYA SWKPVAPGEP RYRLPERGEW MALGNMLETG
EKGDRTLVID RYSNDHDLKT GVCGMASGVM EWTSSMEKDP ARVKEPPGPV ACGGDWRQPG
ISNRVEYLRS RGESRDNLGF RIVRNVL