Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1519 |
Symbol | |
ID | 6274427 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 1814271 |
End bp | 1817507 |
Gene Length | 3237 bp |
Protein Length | 1078 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642613578 |
Product | helicase domain protein |
Protein accession | YP_001878121 |
Protein GI | 187736009 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0553] Superfamily II DNA/RNA helicases, SNF2 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00298818 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.00000196773 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAGCTTA TAGATAACAT AAACAAGACC CTGAAAGGGG ATTTGACCAG GGAGTTGCAC GACGGCAGCA AAGTTGCCAT TGCCGCTTCG TGCTTTTCCA TTTACGCCTT TGAGGAGCTG AAAGCGCAGC TTAAAGACAT TGAGGAGTTG CGCTTTATCT TTACCTCCCC GACTTTCGTC ACCGAGAAAG CCAACAAGCA AAAACGGGAG TTCTATATTC CACGGTTGAA CAGGGAGGGC AATCTCTATG GCTCTGAGTT TGAGATAAAA TTGCGCAACG AGCTTTCGCA AAAAGCCATT GCCAAAGAGT GCGCCGAATG GATCAGGCGG AAAGCCTGCT TCAAGTCCAA CTATACGGGA GAGAACATGA TGGGCTTCGC CACCGTTGAC GACAAGGCGT ATATGCCCGT CACCGGATTT ACCACGGTGG AATTGGGATG CGAGCGGGGC GACAACGCCT ACACGGTGAT AAACAAGTTC GAGGCTCCTT TCTCGCAGCA ATACCTTTTC CTTTTCGATC AGCTTTGGAA TGACAGCGCG AAGATGCAGG TAGTCACGGA CAAGATACTC GACAACATCA GCAATGTCTA TAAGGAGAAC GCCCCCGACT TCATCTATTT CGTCACCTTG TACAATATCT TCCGTGAATT CCTTGACAAC CTCTCCGAAG ACGACCTGCC CAACGAGGCA ACCGGATTCA AGGAAAGCCA GATATGGAAC AAACTATATA ATTTCCAGAA GGATGCCTGC CTTGCCATCA TCAACAAGTT GGAAAAGTAC AACGGCTGCA TCCTTGCCGA CAGCGTGGGA TTGGGCAAGA CGTTTACCGC CCTCTCGGTC ATCAAGTATT ACGAGAACCG CAACAAGTCA GTGCTTGTGC TTTGCCCGAA GAAACTTAAC GACAACTGGA TAACCTACAA GAGCAACTAT CTGAACAACC CCATCGCCCG TGACCGCCTG CGCTATGACG TGTTCTTCCA TTCTGACCTG TCACGCCGGG GTGGAAAGAC CAACGGGCAG GACTTGGACC ATATCAACTG GGGCAACTAC GACCTTGTGG TCATAGACGA GAGCCATAAT TTCCGAAACG GCGGCAAGGT GACAACGGAT GGGAATGACG ACAACCCGCG CGAGAACCGC TACCTGCAAC TGCTGAACAA GGTTATCCGG GCGGGGGTCA AGACAAAAGT GCTGATGCTC TCGGCCACGC CCGTGAACAA CCGTTTCAAC GATTTGAAGA ATCAGATACA GCTTGCCTAC GAAGGTGAGT GCGGCAGGAT GGATGCCCTG TTGAACACGT CTTCCGGCAT TGACGACATT TTCCGTCAGG CTCAGAAGGA ATACAACATA TGGACGAAGC TCCCGCCGAA GGAGCGCACT ACGAAAGAAC TGCTCAGCCG GTTGAGCTTC GACTTCTTTG AAGTACTCGA CAGCGTTACT ATTGCCCGCA GCCGCAAGCA TATAGAGCAG TATTACAATA CGGCGGACAT CGGCAAGTTC CCCATGCGGC TGGCTCCGCT GTCACGCCGT CCCAACCTGA CTGACCTCGA CACGGCTATC AACTATAACG GGATTTACGA AATTGTCAGC CGCCTCAACC TTGCCATTTA TACCCCTTCC GATTTCATTC TGCCAAGCCG GAGGAAAAAA TACATGGACG AGGATAACGA CTTCCACGGT CGCGGCAGAG CCGGACGCGA GAAGGGAATA CGCCGCCTGA TGAGCATCAA CCTGCTTAAA CGTCTGGAAA GCTCGGTCAA TTCTTTCCGC CTCACCCTTG AAAGAATCAG GAAGTTGATA GCGTCAACCA TCGGACAGCT TGATGAGTTG AATCAGGGCG ACAGCTTTTC CATTGAATGG GAGGATATTT CGCGGAATCT CGATGCCGAT GATCGAGAGG CGGATATGTT CATCGGTGGC AAGAAGACAA GAATTGCCCT GCAGGATCTT GACCACATCA CATGGCGCGG CTACCTCAAA AAGGATTTGG AGAACCTTGA TCTTCTGCTG CTCATGATTG GCTCCATCAC TCCGGAACAC GACCGCAAGC TGCAACAGTT GATTGCCGAC TTGCGCGGCA AGTTTGCCTG CCCCATCAAT GCAGGCAACA GGAAGGTGAT TATCTTCACG GCGTTCTCCG ATACCGCCGA ATACCTGTAC GGTTGTCTGG CTGCTCCCAT TCTGGAAAAG TATGGGCTGC ATACGGCACT CGTCACCGGA GACGTGGAAG CACGCAGTAC ATTGCGCCTT CCACAACGCG AGAAGCTCGA CTTCAATAAG GTGCTTACGC TCTTCTCTCC AATTTCAAAG GAGAAGTCTG CCATTTATCC GCACATCCGG GAGGAGATAG ACGTGCTTAT CGCTACCGAC TGCATCAGCG AGGGACAGAA CTTGCAGGAT TGTGACTGTC TCATCAATTA CGACATCCAT TGGAATCCCG TGCGCATCAT TCAGCGTTTC GGGCGTATCG ACCGCATTGG CTCGCGCAAC GAGGTGATTC AGCTTGTGAA CTACTGGCCG GACGTGACCC TTGACGAATA CATCGACCTG AAAGGACGGG TGGAAGCCCG CATGAAGGTG TCGGTGCTGA CAAGTACGGG CGATGACAAT CCCATATCGC CTGAAGAGAA AGGCGACCTT GAATACCGGC GCGAGCAACT GAAACGCCTG CAGAGCGAGG TGGTGGACAT GGAGGAGATG AACACGGGCG TTTCCATCAT GGATTTGGGG CTGAACGAGT TCCGTCTTGA CCTGCTTGCC TGCCTGAAAG ACGCTCCCGG CCTGGAGCAT ATGCCTTTCG GTCTTCATGC CGTGGTACCC GCAAGTGACG GTGCGCCTGC CGGAGCGGTG TTCGTATTGA AAAACCGTAA CAACGGCGTG AATATTGACC ACAGGAACCG CCTGCATCCT TTCTATATGG TCTATATCTC GGAGAGCAGG GAAGTGGTGG TCAATCATCT TTCACCCAAG GAGATGCTCG ATCGCCTGCG CTTCCTGTGC AAAGGCAAGA CCGCACCGGA CATGGAGCTA TGCCGGGAGT TCAACAAGGC CACTTCCGAC GGGAAGAACA TGCGGCGGTA TTCCGGACTG TTGGGCGATG CTATCTCCTC CATCATCCAT GTGAAGGAAG AGAGCGACAT CGACAGTTTC CTGAGCGGCG TGCAAGGTTT GCTCTTCACG GAAGAGATAC GCGGGCTGGA CGACTTTGAA TTAATCTGTT TTCTCGTCAT CAAGTGA
|
Protein sequence | MELIDNINKT LKGDLTRELH DGSKVAIAAS CFSIYAFEEL KAQLKDIEEL RFIFTSPTFV TEKANKQKRE FYIPRLNREG NLYGSEFEIK LRNELSQKAI AKECAEWIRR KACFKSNYTG ENMMGFATVD DKAYMPVTGF TTVELGCERG DNAYTVINKF EAPFSQQYLF LFDQLWNDSA KMQVVTDKIL DNISNVYKEN APDFIYFVTL YNIFREFLDN LSEDDLPNEA TGFKESQIWN KLYNFQKDAC LAIINKLEKY NGCILADSVG LGKTFTALSV IKYYENRNKS VLVLCPKKLN DNWITYKSNY LNNPIARDRL RYDVFFHSDL SRRGGKTNGQ DLDHINWGNY DLVVIDESHN FRNGGKVTTD GNDDNPRENR YLQLLNKVIR AGVKTKVLML SATPVNNRFN DLKNQIQLAY EGECGRMDAL LNTSSGIDDI FRQAQKEYNI WTKLPPKERT TKELLSRLSF DFFEVLDSVT IARSRKHIEQ YYNTADIGKF PMRLAPLSRR PNLTDLDTAI NYNGIYEIVS RLNLAIYTPS DFILPSRRKK YMDEDNDFHG RGRAGREKGI RRLMSINLLK RLESSVNSFR LTLERIRKLI ASTIGQLDEL NQGDSFSIEW EDISRNLDAD DREADMFIGG KKTRIALQDL DHITWRGYLK KDLENLDLLL LMIGSITPEH DRKLQQLIAD LRGKFACPIN AGNRKVIIFT AFSDTAEYLY GCLAAPILEK YGLHTALVTG DVEARSTLRL PQREKLDFNK VLTLFSPISK EKSAIYPHIR EEIDVLIATD CISEGQNLQD CDCLINYDIH WNPVRIIQRF GRIDRIGSRN EVIQLVNYWP DVTLDEYIDL KGRVEARMKV SVLTSTGDDN PISPEEKGDL EYRREQLKRL QSEVVDMEEM NTGVSIMDLG LNEFRLDLLA CLKDAPGLEH MPFGLHAVVP ASDGAPAGAV FVLKNRNNGV NIDHRNRLHP FYMVYISESR EVVVNHLSPK EMLDRLRFLC KGKTAPDMEL CREFNKATSD GKNMRRYSGL LGDAISSIIH VKEESDIDSF LSGVQGLLFT EEIRGLDDFE LICFLVIK
|
| |