Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_2006 |
Symbol | |
ID | 6274539 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 2437064 |
End bp | 2438134 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 642614065 |
Product | integrase family protein |
Protein accession | YP_001878597 |
Protein GI | 187736485 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.162448 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACATAC CTACTTCCTC TTTCCCAACT GACGCAGACC AGGCCGGAAT CTTCCTCATC CGTTCCCTGG GCATTCCGCC CATGGACGCT TTCCTTCTTT TAAAGGATCT CCTGGACACC AGCCGCGGAA GAGGCGACAG AATAACCCGG GCCAAACGCT GCATACGGCT GGGAGGAGAG GCTCTTGCCG ACAGGGAAAA AAGCGTATCG TTTTCCCAGG CTGTCCGCGC CAGCCTGGAA GCAAGGAAAC ACCGCCGTCC CCGCACGCTG CAGGAAATCC GCTATATGGC CGCCCGGATG ATGAAAAAAT GCCCGGAGCT GGCAAGGAAA CAGGTCCGTT CCATCACTCC GGAAGATTGC GGGCGTTATC TCCGCAAAAG CTTTCCCACT CCCCGCCAGC GGCACAAGGG GCGGCTGATC CTGAGCGGCA TCCTGAATTA TTCCCTGAAG CGCGGATGGT GCCGCAGAAA CGCGGCCTTT CTGGTTCCTC CCCCCATCCT CAGGGAAAAA CGCATCAGGG CCCTTTCCCT GTACGAGGCA AAGCGGCTTC TCCACACTGC GGAACAGTTG TTCCGAGGGG AATGCCTGCC GGCCTGCGCC CTGATGCTGT ACGCGGGTAT ACGCCCCCAC GAGGTCAAAA GGCTGACGTG GAAGCATATC AATCTGAAAT CCGGCCTGGT TTCACTGGCG CCCACCCATA CCAAAACGGG AGGGAGCCGC CATGTTTCCA TCCTTCCCGT GCTGGGTTCC ATCCTCAGCC GGATGTCTTC CGCCGGTTCC CCCGCCCGTT CCGTCTGCCC GCCCAACTGG GAAAAGAAAT GGAAGGAAGT AAGGCGCCGG TCCGGCATCC TGAAGAAAAG CGGATGGGTT CAGGACGTGC TGAGGCATAC CTACGCCTCC TACCACCTGG CCCATTTCTG CAATCAAAAC CTTCTCCAGA AGGAGATGGG ACACTCCTCC CCCTCCCTGC TGCTGGCCCG CTATCTTAAT ATGGAGGGCA TCACCTCCGC AACCGGCGCC ATGTTCTGGA CGCACAGCTT TGTTTCTCCC GCTCCGTTAA AGGAAGACTG A
|
Protein sequence | MNIPTSSFPT DADQAGIFLI RSLGIPPMDA FLLLKDLLDT SRGRGDRITR AKRCIRLGGE ALADREKSVS FSQAVRASLE ARKHRRPRTL QEIRYMAARM MKKCPELARK QVRSITPEDC GRYLRKSFPT PRQRHKGRLI LSGILNYSLK RGWCRRNAAF LVPPPILREK RIRALSLYEA KRLLHTAEQL FRGECLPACA LMLYAGIRPH EVKRLTWKHI NLKSGLVSLA PTHTKTGGSR HVSILPVLGS ILSRMSSAGS PARSVCPPNW EKKWKEVRRR SGILKKSGWV QDVLRHTYAS YHLAHFCNQN LLQKEMGHSS PSLLLARYLN MEGITSATGA MFWTHSFVSP APLKED
|
| |