Gene Amuc_0920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0920 
Symbol 
ID6274245 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1099438 
End bp1100859 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content58% 
IMG OID642612974 
Productoxidoreductase domain protein 
Protein accessionYP_001877533 
Protein GI187735421 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.746532 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value0.6877 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTATTT TCTCATCACG CAGACAATTT CTCAAATCTT TGGGGCTTGC GGCCGGAGCG 
GCTGCCGCCG GAAATGCCCT CCCTGGGAAG GCTGTGGAAA TCCCTGCCGG AGACCATCTC
TGGAAATCCG CCTCTCCGGC GGCTCCGAGG CCTTCCGGTT CCACATACAT GGGAGGGTTC
AAGGCTCCCC GGCTGGGTCG CATCAGGCTG GCCTTCATCG GCGTGGGAGG GCGCGGGTTC
TCCCACCTGG CGCAAATGTG CGTGATGGAT GGAGTGGAAA TCGTGGGCAT ATGTGATTTG
AAGGAAGAGT TGACGAAACG CGGCGTGGAT CGCGTGCTCT CCAGAATGGG GAAAAGCCCT
TTGGGCTATT CCGGCGGCGA TATGGAATAC CTGACCATGC TGAAGGAGCT GAAGCCGGAT
GCCGTCATCA TCAGTACGGA TTGGAGTTCG CATGCCAGAA TCGCCTGCGA CAGCATGAAG
CACGGCGCTC ACGCCTTTGT GGAAGTTCCT CTGGCCGTCT CTCTGGAGGA GCTCTGGAGC
CTGGTGGATA CCAGCGAGGC CACCAGGAAA CATTGCATGA TGATGGAAAA CGTCAACTAT
GGGCGGGATG AACTCATGTT CCTGAACATG GTCCGGCAGG GCGTCATCGG CGATTTGCTT
CACGGGGAGG CCGCGTATAT CCATTGCCTG GTGACGCAGC TGGGGGACAC GCGCGGGGAA
GGGGCCTGGC GGCCGGAATA TCATACCAGA ATCAATGGCA ACCTGTACCC CACCCACGGG
TTGGGGCCGG TGGCTCAATA TATGAATTTG GAGCGTGGAG AGGACCGTTT CTGCCGTGTG
GCGGCGTTCG CTTCTCCTGC TCTCGGGCGC AATGCCTACG CTAAAAAGCA TCTTCCCGCC
GATCACCGCT GGAACAATAC TCCATTCATC TGCGGTGACA TGAATACGGC TGTTGTCAAG
ACGCAGCTGG GGCGGACCAT TCTTGTCCAG CTGGATGAGA CGTCCCCCCG GCCTTACTCC
CGCGCCAACC TGATCCAGGG AACGGAGGGC ACGCTGGCTG GTTTCCCAAC CCGCGTGGCG
GGTGAAAAGC TGGGCAACGG CAATTATCAT GAATGGATTG AAGGCAGGGA AAAACTGGCC
GCTATTTATG AAAAATACGA TCATCCCCTC TGGAAACGCA TCGGGGAGCT GGCCACGAAA
ATGGGCGGTC ACGGCGGTAT GGACTTTGTG ATGCTTTCCC GCATCGTGGA ATGCCTCCGG
AACGGAGAAC CAATGGATCA GAACGTTTAC GAAGGAGCTT CCTGGTCTTC CCTGCTGCCG
TTGACAGCCC GTTCCATCGC CCAGGGCGGG ATGCCTGTGG AATTTCCGGA TTTTACCCGC
GGAGACTGGA AAACCACCAT GCCGCTGGCC GTGGTTTCAT GA
 
Protein sequence
MSIFSSRRQF LKSLGLAAGA AAAGNALPGK AVEIPAGDHL WKSASPAAPR PSGSTYMGGF 
KAPRLGRIRL AFIGVGGRGF SHLAQMCVMD GVEIVGICDL KEELTKRGVD RVLSRMGKSP
LGYSGGDMEY LTMLKELKPD AVIISTDWSS HARIACDSMK HGAHAFVEVP LAVSLEELWS
LVDTSEATRK HCMMMENVNY GRDELMFLNM VRQGVIGDLL HGEAAYIHCL VTQLGDTRGE
GAWRPEYHTR INGNLYPTHG LGPVAQYMNL ERGEDRFCRV AAFASPALGR NAYAKKHLPA
DHRWNNTPFI CGDMNTAVVK TQLGRTILVQ LDETSPRPYS RANLIQGTEG TLAGFPTRVA
GEKLGNGNYH EWIEGREKLA AIYEKYDHPL WKRIGELATK MGGHGGMDFV MLSRIVECLR
NGEPMDQNVY EGASWSSLLP LTARSIAQGG MPVEFPDFTR GDWKTTMPLA VVS