Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_5766 |
Symbol | |
ID | 4643723 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | - |
Start bp | 6155162 |
End bp | 6159094 |
Gene Length | 3933 bp |
Protein Length | 1310 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 639809242 |
Product | ATP-dependent helicase HrpA |
Protein accession | YP_956537 |
Protein GI | 120406708 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1643] HrpA-like helicases |
TIGRFAM ID | [TIGR01967] ATP-dependent helicase HrpA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.378502 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.114253 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCCGAAC CGTCCCGCAC CGATGTGCGC GCCCTGCGTG CGCGGCTGGA CGATCTGACG ATCAGCGACG CCGCCCGCCT CGGCCGTCGC CTGCGCCAGC TGCGCGACCC GTCGGAAGAA CAGCTGGGCA AGCTTGCGAA GCAGTTCGAC ACGGCGGAAG CGCTGGTGGC TTCCCGCCTC GCCGCGGTGC CGCAGATCAG CTATCCCGAC CTGCCGGTCA CCGAGCGGCG CGACGAGATC GCCGCGGCCA TCGCCGCCAA CCAGGTCGTG ATCGTGGCGG GCGAGACCGG CTCGGGCAAG ACCACGCAGC TGCCCAAGAT CTGCCTCGAG CTCGGGCGGG GTATCCGCGG CACCATCGGC CACACTCAGC CGCGCCGGCT GGCCGCCCGC ACCGTCGCGG CGCGTATCGC CGAGGAACTG GGCACCCCGC TGGGGGAGGC GGTCGGTTAC ACGGTGCGGT TCACCGATCA GGCCTCCGAC CGGACGCTGG TCAAGCTGAT GACCGACGGC ATCCTGCTCG CCGAGGTGCA GCGGGACCGC CGGCTGCTGC GCTACGACAC GTTGATCATC GACGAGGCCC ATGAGCGCAG CCTCAACATC GACTTCCTGC TCGGCTACCT GCGGCAGCTG CTGCCGCGGC GACCCGACCT GAAGGTGATC GTGACGTCGG CCACGATCGA GCCGGAGCGG TTCGCGGCGC ACTTCGCCGG AGCGCCGATC GTCGAGGTGA GCGGACGCAC GTATCCGGTC GAGATCAGGT ACCGGCCTTT GGAAGTCACG GTTCCCGGTC AGGACGGCGA GGATCCCGAC GACCCCGACC ACGAGGTCGT GCGCACCGAG CTGCGCGATC CCACCGAGGC CATCATCGAC GCGGTGGCCG AACTGGAGGC GGAGCCACCC GGAGACGTGC TGGTGTTCCT GTCCGGTGAG CGTGAGATCC GGGACACCGC GGAGGCTTTG CGGGCGGTGG TGGACCCCGG CCACACCGAG GTGTTGCCGC TCTATGCCCG GCTGCCGACC GCGGAACAGC AGAAGGTGTT CCACCCGGGC CGGACGGCAC GCCGAATTGT GTTGGCCACC AACGTGGCCG AGACGTCGCT GACGGTCCCG GGCATCCGGT ACGTCGTCGA CCCCGGGACG GCGCGCATTT CGCGGTACAG CCGCAGGACC AAGGTTCAGC GGCTGCCGAT CGAACCGATC TCGCAGGCCT CGGCCGCCCA GCGGGCCGGC CGGTCGGGCC GCACCGCGCC CGGCGTCTGC ATCCGGCTGT ATTCGGAGCA GGACTTCGAG GCCCGGCCCC GATACACCGA CCCGGAGATC CTGCGCACCA ACCTGGCCGC GGTGATCCTG CAGATGGCCG CGTTGGGGTT CGGCGACATC GAAGGTTTCG GGTTCCTCGA CCCGCCCGAT GCGCGCAGCA TCCGCGACGG CGTCGCGCTG CTGCAGGAGC TCGGCGCGTT CGACCAGCAG GCCGATCTCA CCGACATCGG ACGCAGGCTG GCGCAGATTC CCGTCGACCC ACGCCTGGGC CGGATGATCC TGCAGGCCGA CGCCGAGGGC TGTGTGCGCG AGATGCTGGT GCTGGCTGCG GCACTGTCGA TTCCGGACCC GCGGGAGCGG CCCGCCGACA AGGAGGAGGC GGCCCGGCAG AAGCACGCCC GCTTCGCCGA CCAGCATTCG GACTTCACGT CGTACCTGAA TCTGTGGCAC TACCTGACCG AGCAGCGAAA AGAACGCTCC GGCAGCTCCT TTCGGCGGAT GTGCCGCGAC GAGTTCCTGC ACTACCTGCG GATCCGGGAG TGGCAGGACC TGGTGGGTCA ACTGCGTGGC ATCTGCCGCG ATATCGGGAT CCGGGAACAG GACGAGCCGG CTGATCCGGC CGCGGTGCAC GCCGCACTGG CGGCAGGACT GTTGTCGCAC GTCGGGATGC GGGACACCGA CGGCCGCAGC TACCAGGGTG CCCGCAACGC CAAATTCGTC CTCGCCCCGG GCTCGGTGCT GAGCAAGCGT CCGCCGCGCT GGGTGGTGGT CGCCGACCTG GTGGAGACCA GCAGGCTGTT CGGCAGGACC GCGGCGCGGA TCGAGCCCGA GACCGTGGAG CGGGTCGCGG GGCACCTCGT GCAGCGGACC TACAGCGAGC CGCACTGGGA TGCCGAGCGC GGCGCGGTGA TGGCCTACGA GCGGGTCACC CTGTACGGCC TTCCGCTGGT GGCGCGCCGC CGGGTCGGAT ACGCGCAGGT GGACCCGGAA GCCGCGCGGG ACCTGTTCAT CCGGCATGCG CTGGTCGAGG GGGACTGGCA GACCAAACAC CACTTCTTCC GCGACAACGC AAGGCTGCGT GAGGAACTCG CCGAACTGGA GGACCGGGCC CGCCGTCGGG ACCTGCTGGT CGGCGACGAC GAGGTCTTCG CCTTCTACGA CGCCAGGGTG CCCGCGGACG TGGTGTCGGC CCGCCACTTC GACGGATGGT GGCGCAAGCA GAGGCACCGC ACACCCGGGC TGCTCACGAT GACCCGCGAC GATCTGCTGC GCAACGAGGC CGGAGCGGAG CAGCCCGACG CCTGGCAGGC CGGGGACCTG TCGTTGCCGC TGAGCTACCG ATACGACCCC GGCTCGGCCG ATGACGGTGT GACCGTGCAC GTTCCGGTGG ATGTGCTGGC CCGCCTCGGT GGCGACGAAT TCGCCTGGCA GGTGCCCGCG CTGCGCGAGG AGCTGCTCAC CGCGTTGATC AAGTCGCTGC CCAAGGACCT GCGCCGCAAC TTCGTGCCCG CGCCGGACAC CGCACGGGCG CTGCTGGCGT CCATCACTCC CGACAGCGGG CCGTTACTCG ACGCGATACA GCGTGAATTA CGGCGTCGCA CAGGAATTCT GGTGCCCATC GATGCGTTCG ACTTGGACAA GCTGCCGCCG CACCTGCGGG TCACGTTCGC GGTGGAGGCC GCCGACGGCA CCGTCGTGTC CCGCGGCAAG GATCTCGGCG AGCTGCAGCA GCAGCTCGCC GCGCCGTCCC GCCAGGCCGT CGCGGCCACC GTCGCAGGTG ATCTGGAGCG CACCGGTCTG CGCGACTGGC CGCCGGAGCT CGACGAGCTG CCCCGTGTAG TGGAACGGAA AGGCGCCACC GGGCACCTGG TCCGCGGCTA TCCCGCGTTG GTGGACGCGG GGAAGGCGGC CGACATCAAG GTCTTTGCGA CGAAGGCCGA ACAGGATGCG GCGATGGGCC CCGGCTGTAG GCGGCTGTTG CTGACCGCGG GGCCGTCGGT GACCAAAGGC GTTGAGCGGT CCCTGGATAC CCGCACACGC CTGGTGTTGG GCACCAACCC GGATGGATCC CTGGCGGCGC TGATCGACGA CTGCGCTGCG GCGGCGGTGC AGGTGCTGGT CCCCGCGCCG GCGTGGACGA GAGAAGAATT CGCCGCGGCT CGGAAAAGGC TGGCGGCCAA CATCGTTGCG ACCACCGCCG ACGTGGTGCG TCGGGTCGAG AAGGTGCTCG TCGCGCTGCA CGAGGTCGAG GTCGCGCTGC CGTCGAAGCC CACCCCTGCC CAGGCCGACG CGATCGCCGA CATCCGCGGG CAGCTGGCTC GTCTGGTTCC GCCGGGATTC GTCGCCGCGA CCGGCGCGGC CCGGCTGGGG GATCTGGCCC GCTACTTGAC CGCCATCGGG CGTCGGCTGG AGCGGCTGCC GCACGCACCC GCCGCCGACC GGGAGCGGAT GGAGCGCGTC GCGGCGGTCC AGGCCGAGTA CGACGATCTG CGCCGCGCGC TGTCGCCGGG TCGCGCGGCG GCGACCGATG TGCAGGACAT CGCCCGCATG ATCGAGGAGC TGCGGGTGAG CCTGTGGGCG CAGCAGCTCG GTACCGCCAG GCCGGTCAGC GAGCAACGCA TCTACCGGGC CATCGACGCC GTCGGCGTCG CCGGGCCGGG CGGGGGTCGG TGA
|
Protein sequence | MPEPSRTDVR ALRARLDDLT ISDAARLGRR LRQLRDPSEE QLGKLAKQFD TAEALVASRL AAVPQISYPD LPVTERRDEI AAAIAANQVV IVAGETGSGK TTQLPKICLE LGRGIRGTIG HTQPRRLAAR TVAARIAEEL GTPLGEAVGY TVRFTDQASD RTLVKLMTDG ILLAEVQRDR RLLRYDTLII DEAHERSLNI DFLLGYLRQL LPRRPDLKVI VTSATIEPER FAAHFAGAPI VEVSGRTYPV EIRYRPLEVT VPGQDGEDPD DPDHEVVRTE LRDPTEAIID AVAELEAEPP GDVLVFLSGE REIRDTAEAL RAVVDPGHTE VLPLYARLPT AEQQKVFHPG RTARRIVLAT NVAETSLTVP GIRYVVDPGT ARISRYSRRT KVQRLPIEPI SQASAAQRAG RSGRTAPGVC IRLYSEQDFE ARPRYTDPEI LRTNLAAVIL QMAALGFGDI EGFGFLDPPD ARSIRDGVAL LQELGAFDQQ ADLTDIGRRL AQIPVDPRLG RMILQADAEG CVREMLVLAA ALSIPDPRER PADKEEAARQ KHARFADQHS DFTSYLNLWH YLTEQRKERS GSSFRRMCRD EFLHYLRIRE WQDLVGQLRG ICRDIGIREQ DEPADPAAVH AALAAGLLSH VGMRDTDGRS YQGARNAKFV LAPGSVLSKR PPRWVVVADL VETSRLFGRT AARIEPETVE RVAGHLVQRT YSEPHWDAER GAVMAYERVT LYGLPLVARR RVGYAQVDPE AARDLFIRHA LVEGDWQTKH HFFRDNARLR EELAELEDRA RRRDLLVGDD EVFAFYDARV PADVVSARHF DGWWRKQRHR TPGLLTMTRD DLLRNEAGAE QPDAWQAGDL SLPLSYRYDP GSADDGVTVH VPVDVLARLG GDEFAWQVPA LREELLTALI KSLPKDLRRN FVPAPDTARA LLASITPDSG PLLDAIQREL RRRTGILVPI DAFDLDKLPP HLRVTFAVEA ADGTVVSRGK DLGELQQQLA APSRQAVAAT VAGDLERTGL RDWPPELDEL PRVVERKGAT GHLVRGYPAL VDAGKAADIK VFATKAEQDA AMGPGCRRLL LTAGPSVTKG VERSLDTRTR LVLGTNPDGS LAALIDDCAA AAVQVLVPAP AWTREEFAAA RKRLAANIVA TTADVVRRVE KVLVALHEVE VALPSKPTPA QADAIADIRG QLARLVPPGF VAATGAARLG DLARYLTAIG RRLERLPHAP AADRERMERV AAVQAEYDDL RRALSPGRAA ATDVQDIARM IEELRVSLWA QQLGTARPVS EQRIYRAIDA VGVAGPGGGR
|
| |