Gene Amuc_1519 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1519 
Symbol 
ID6274427 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1814271 
End bp1817507 
Gene Length3237 bp 
Protein Length1078 aa 
Translation table11 
GC content52% 
IMG OID642613578 
Producthelicase domain protein 
Protein accessionYP_001878121 
Protein GI187736009 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00298818 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.00000196773 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAGCTTA TAGATAACAT AAACAAGACC CTGAAAGGGG ATTTGACCAG GGAGTTGCAC 
GACGGCAGCA AAGTTGCCAT TGCCGCTTCG TGCTTTTCCA TTTACGCCTT TGAGGAGCTG
AAAGCGCAGC TTAAAGACAT TGAGGAGTTG CGCTTTATCT TTACCTCCCC GACTTTCGTC
ACCGAGAAAG CCAACAAGCA AAAACGGGAG TTCTATATTC CACGGTTGAA CAGGGAGGGC
AATCTCTATG GCTCTGAGTT TGAGATAAAA TTGCGCAACG AGCTTTCGCA AAAAGCCATT
GCCAAAGAGT GCGCCGAATG GATCAGGCGG AAAGCCTGCT TCAAGTCCAA CTATACGGGA
GAGAACATGA TGGGCTTCGC CACCGTTGAC GACAAGGCGT ATATGCCCGT CACCGGATTT
ACCACGGTGG AATTGGGATG CGAGCGGGGC GACAACGCCT ACACGGTGAT AAACAAGTTC
GAGGCTCCTT TCTCGCAGCA ATACCTTTTC CTTTTCGATC AGCTTTGGAA TGACAGCGCG
AAGATGCAGG TAGTCACGGA CAAGATACTC GACAACATCA GCAATGTCTA TAAGGAGAAC
GCCCCCGACT TCATCTATTT CGTCACCTTG TACAATATCT TCCGTGAATT CCTTGACAAC
CTCTCCGAAG ACGACCTGCC CAACGAGGCA ACCGGATTCA AGGAAAGCCA GATATGGAAC
AAACTATATA ATTTCCAGAA GGATGCCTGC CTTGCCATCA TCAACAAGTT GGAAAAGTAC
AACGGCTGCA TCCTTGCCGA CAGCGTGGGA TTGGGCAAGA CGTTTACCGC CCTCTCGGTC
ATCAAGTATT ACGAGAACCG CAACAAGTCA GTGCTTGTGC TTTGCCCGAA GAAACTTAAC
GACAACTGGA TAACCTACAA GAGCAACTAT CTGAACAACC CCATCGCCCG TGACCGCCTG
CGCTATGACG TGTTCTTCCA TTCTGACCTG TCACGCCGGG GTGGAAAGAC CAACGGGCAG
GACTTGGACC ATATCAACTG GGGCAACTAC GACCTTGTGG TCATAGACGA GAGCCATAAT
TTCCGAAACG GCGGCAAGGT GACAACGGAT GGGAATGACG ACAACCCGCG CGAGAACCGC
TACCTGCAAC TGCTGAACAA GGTTATCCGG GCGGGGGTCA AGACAAAAGT GCTGATGCTC
TCGGCCACGC CCGTGAACAA CCGTTTCAAC GATTTGAAGA ATCAGATACA GCTTGCCTAC
GAAGGTGAGT GCGGCAGGAT GGATGCCCTG TTGAACACGT CTTCCGGCAT TGACGACATT
TTCCGTCAGG CTCAGAAGGA ATACAACATA TGGACGAAGC TCCCGCCGAA GGAGCGCACT
ACGAAAGAAC TGCTCAGCCG GTTGAGCTTC GACTTCTTTG AAGTACTCGA CAGCGTTACT
ATTGCCCGCA GCCGCAAGCA TATAGAGCAG TATTACAATA CGGCGGACAT CGGCAAGTTC
CCCATGCGGC TGGCTCCGCT GTCACGCCGT CCCAACCTGA CTGACCTCGA CACGGCTATC
AACTATAACG GGATTTACGA AATTGTCAGC CGCCTCAACC TTGCCATTTA TACCCCTTCC
GATTTCATTC TGCCAAGCCG GAGGAAAAAA TACATGGACG AGGATAACGA CTTCCACGGT
CGCGGCAGAG CCGGACGCGA GAAGGGAATA CGCCGCCTGA TGAGCATCAA CCTGCTTAAA
CGTCTGGAAA GCTCGGTCAA TTCTTTCCGC CTCACCCTTG AAAGAATCAG GAAGTTGATA
GCGTCAACCA TCGGACAGCT TGATGAGTTG AATCAGGGCG ACAGCTTTTC CATTGAATGG
GAGGATATTT CGCGGAATCT CGATGCCGAT GATCGAGAGG CGGATATGTT CATCGGTGGC
AAGAAGACAA GAATTGCCCT GCAGGATCTT GACCACATCA CATGGCGCGG CTACCTCAAA
AAGGATTTGG AGAACCTTGA TCTTCTGCTG CTCATGATTG GCTCCATCAC TCCGGAACAC
GACCGCAAGC TGCAACAGTT GATTGCCGAC TTGCGCGGCA AGTTTGCCTG CCCCATCAAT
GCAGGCAACA GGAAGGTGAT TATCTTCACG GCGTTCTCCG ATACCGCCGA ATACCTGTAC
GGTTGTCTGG CTGCTCCCAT TCTGGAAAAG TATGGGCTGC ATACGGCACT CGTCACCGGA
GACGTGGAAG CACGCAGTAC ATTGCGCCTT CCACAACGCG AGAAGCTCGA CTTCAATAAG
GTGCTTACGC TCTTCTCTCC AATTTCAAAG GAGAAGTCTG CCATTTATCC GCACATCCGG
GAGGAGATAG ACGTGCTTAT CGCTACCGAC TGCATCAGCG AGGGACAGAA CTTGCAGGAT
TGTGACTGTC TCATCAATTA CGACATCCAT TGGAATCCCG TGCGCATCAT TCAGCGTTTC
GGGCGTATCG ACCGCATTGG CTCGCGCAAC GAGGTGATTC AGCTTGTGAA CTACTGGCCG
GACGTGACCC TTGACGAATA CATCGACCTG AAAGGACGGG TGGAAGCCCG CATGAAGGTG
TCGGTGCTGA CAAGTACGGG CGATGACAAT CCCATATCGC CTGAAGAGAA AGGCGACCTT
GAATACCGGC GCGAGCAACT GAAACGCCTG CAGAGCGAGG TGGTGGACAT GGAGGAGATG
AACACGGGCG TTTCCATCAT GGATTTGGGG CTGAACGAGT TCCGTCTTGA CCTGCTTGCC
TGCCTGAAAG ACGCTCCCGG CCTGGAGCAT ATGCCTTTCG GTCTTCATGC CGTGGTACCC
GCAAGTGACG GTGCGCCTGC CGGAGCGGTG TTCGTATTGA AAAACCGTAA CAACGGCGTG
AATATTGACC ACAGGAACCG CCTGCATCCT TTCTATATGG TCTATATCTC GGAGAGCAGG
GAAGTGGTGG TCAATCATCT TTCACCCAAG GAGATGCTCG ATCGCCTGCG CTTCCTGTGC
AAAGGCAAGA CCGCACCGGA CATGGAGCTA TGCCGGGAGT TCAACAAGGC CACTTCCGAC
GGGAAGAACA TGCGGCGGTA TTCCGGACTG TTGGGCGATG CTATCTCCTC CATCATCCAT
GTGAAGGAAG AGAGCGACAT CGACAGTTTC CTGAGCGGCG TGCAAGGTTT GCTCTTCACG
GAAGAGATAC GCGGGCTGGA CGACTTTGAA TTAATCTGTT TTCTCGTCAT CAAGTGA
 
Protein sequence
MELIDNINKT LKGDLTRELH DGSKVAIAAS CFSIYAFEEL KAQLKDIEEL RFIFTSPTFV 
TEKANKQKRE FYIPRLNREG NLYGSEFEIK LRNELSQKAI AKECAEWIRR KACFKSNYTG
ENMMGFATVD DKAYMPVTGF TTVELGCERG DNAYTVINKF EAPFSQQYLF LFDQLWNDSA
KMQVVTDKIL DNISNVYKEN APDFIYFVTL YNIFREFLDN LSEDDLPNEA TGFKESQIWN
KLYNFQKDAC LAIINKLEKY NGCILADSVG LGKTFTALSV IKYYENRNKS VLVLCPKKLN
DNWITYKSNY LNNPIARDRL RYDVFFHSDL SRRGGKTNGQ DLDHINWGNY DLVVIDESHN
FRNGGKVTTD GNDDNPRENR YLQLLNKVIR AGVKTKVLML SATPVNNRFN DLKNQIQLAY
EGECGRMDAL LNTSSGIDDI FRQAQKEYNI WTKLPPKERT TKELLSRLSF DFFEVLDSVT
IARSRKHIEQ YYNTADIGKF PMRLAPLSRR PNLTDLDTAI NYNGIYEIVS RLNLAIYTPS
DFILPSRRKK YMDEDNDFHG RGRAGREKGI RRLMSINLLK RLESSVNSFR LTLERIRKLI
ASTIGQLDEL NQGDSFSIEW EDISRNLDAD DREADMFIGG KKTRIALQDL DHITWRGYLK
KDLENLDLLL LMIGSITPEH DRKLQQLIAD LRGKFACPIN AGNRKVIIFT AFSDTAEYLY
GCLAAPILEK YGLHTALVTG DVEARSTLRL PQREKLDFNK VLTLFSPISK EKSAIYPHIR
EEIDVLIATD CISEGQNLQD CDCLINYDIH WNPVRIIQRF GRIDRIGSRN EVIQLVNYWP
DVTLDEYIDL KGRVEARMKV SVLTSTGDDN PISPEEKGDL EYRREQLKRL QSEVVDMEEM
NTGVSIMDLG LNEFRLDLLA CLKDAPGLEH MPFGLHAVVP ASDGAPAGAV FVLKNRNNGV
NIDHRNRLHP FYMVYISESR EVVVNHLSPK EMLDRLRFLC KGKTAPDMEL CREFNKATSD
GKNMRRYSGL LGDAISSIIH VKEESDIDSF LSGVQGLLFT EEIRGLDDFE LICFLVIK