Gene Amuc_2006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_2006 
Symbol 
ID6274539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2437064 
End bp2438134 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content59% 
IMG OID642614065 
Productintegrase family protein 
Protein accessionYP_001878597 
Protein GI187736485 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.162448 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATAC CTACTTCCTC TTTCCCAACT GACGCAGACC AGGCCGGAAT CTTCCTCATC 
CGTTCCCTGG GCATTCCGCC CATGGACGCT TTCCTTCTTT TAAAGGATCT CCTGGACACC
AGCCGCGGAA GAGGCGACAG AATAACCCGG GCCAAACGCT GCATACGGCT GGGAGGAGAG
GCTCTTGCCG ACAGGGAAAA AAGCGTATCG TTTTCCCAGG CTGTCCGCGC CAGCCTGGAA
GCAAGGAAAC ACCGCCGTCC CCGCACGCTG CAGGAAATCC GCTATATGGC CGCCCGGATG
ATGAAAAAAT GCCCGGAGCT GGCAAGGAAA CAGGTCCGTT CCATCACTCC GGAAGATTGC
GGGCGTTATC TCCGCAAAAG CTTTCCCACT CCCCGCCAGC GGCACAAGGG GCGGCTGATC
CTGAGCGGCA TCCTGAATTA TTCCCTGAAG CGCGGATGGT GCCGCAGAAA CGCGGCCTTT
CTGGTTCCTC CCCCCATCCT CAGGGAAAAA CGCATCAGGG CCCTTTCCCT GTACGAGGCA
AAGCGGCTTC TCCACACTGC GGAACAGTTG TTCCGAGGGG AATGCCTGCC GGCCTGCGCC
CTGATGCTGT ACGCGGGTAT ACGCCCCCAC GAGGTCAAAA GGCTGACGTG GAAGCATATC
AATCTGAAAT CCGGCCTGGT TTCACTGGCG CCCACCCATA CCAAAACGGG AGGGAGCCGC
CATGTTTCCA TCCTTCCCGT GCTGGGTTCC ATCCTCAGCC GGATGTCTTC CGCCGGTTCC
CCCGCCCGTT CCGTCTGCCC GCCCAACTGG GAAAAGAAAT GGAAGGAAGT AAGGCGCCGG
TCCGGCATCC TGAAGAAAAG CGGATGGGTT CAGGACGTGC TGAGGCATAC CTACGCCTCC
TACCACCTGG CCCATTTCTG CAATCAAAAC CTTCTCCAGA AGGAGATGGG ACACTCCTCC
CCCTCCCTGC TGCTGGCCCG CTATCTTAAT ATGGAGGGCA TCACCTCCGC AACCGGCGCC
ATGTTCTGGA CGCACAGCTT TGTTTCTCCC GCTCCGTTAA AGGAAGACTG A
 
Protein sequence
MNIPTSSFPT DADQAGIFLI RSLGIPPMDA FLLLKDLLDT SRGRGDRITR AKRCIRLGGE 
ALADREKSVS FSQAVRASLE ARKHRRPRTL QEIRYMAARM MKKCPELARK QVRSITPEDC
GRYLRKSFPT PRQRHKGRLI LSGILNYSLK RGWCRRNAAF LVPPPILREK RIRALSLYEA
KRLLHTAEQL FRGECLPACA LMLYAGIRPH EVKRLTWKHI NLKSGLVSLA PTHTKTGGSR
HVSILPVLGS ILSRMSSAGS PARSVCPPNW EKKWKEVRRR SGILKKSGWV QDVLRHTYAS
YHLAHFCNQN LLQKEMGHSS PSLLLARYLN MEGITSATGA MFWTHSFVSP APLKED