Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_1801 |
Symbol | |
ID | 4647437 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | + |
Start bp | 1909344 |
End bp | 1912460 |
Gene Length | 3117 bp |
Protein Length | 1038 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 639805289 |
Product | UvrD/REP helicase |
Protein accession | YP_952629 |
Protein GI | 120402800 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0210] Superfamily I DNA and RNA helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.078646 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGCGC CGCACACCGA ACTCAGCCCC GAGGCCCTGA GCCGACGGGA CGTCCGAGGC ACAGTCCGGG TGCTGGGCGG GCCCGGCACC GGTAAGAGCA GCCTGCTCGT CGACACCGCC GCGGCCCACA TCGCGGCCGG CTGTGATCCG GAATCGGTTC TGCTGCTGAC GGGTTCGTCG CGGCTGAGCG CCCAGGCCAG GGCGGCGATC ACGACGGCGC TGCTGGGCGC GGGTGCGCGC AGCGCCGTGC GGGAGCCGCT GGTGCGCACC GTGCACTCGT ACGCCTTCGC GGTGTTGCGC CTGGCCGCGC AGCGCAACGG GAGCCCGCCG CCGCGCCTGA TCACCAGCGC CGAGCAGGAC GGGATCATCC GTGAGCTGCT GGCCGGCGAT GTCGAGGACG GCGACGCGTC GCCGGTGCGG TGGCCGGAGC GGCTGCGGCC CGCGCTGAGC ACCGTCGGTT TCGCCACCGA GCTGCGCGAC CTGATGGCCC GCTGCAGCGA GCGTGGTGTC GATCCGCTTG CGCTGCAACG CATCGGCCGG GCGGCCGGCC GGCCGGAATG GCAGGCCGCA GGCCGGTTCG CGCAGGCCTA CGAGCAGGTG ATGCTGCTGC GCGCGGCGGT GGGCATGGCC GCCCCGCAGG CCACGGTTCC CGCGCTGGGT GCCGCCGAGT TGGTCGGGGC GGCGCTGGAG GCGTTCGCCA CCGACGCCGA TCTGCTGGCC GCCGAACGGG CCCGGGTGCA GCTGCTGCTG GTCGACGACG CCCAGCATCT CGACCCGCAG GCGGCCCGCC TGGTCGAGGT GCTGGCCACC GGGGCCGAGC TGGCGGTGAT CGCCGGGGAC CCACACCAGA CGGTGTTCGG TTACCGGGGC GCCGATCCGG CCCTGCTGCG CGGGGAGGGG CCGGCACTGA CGCTGACCCG TTCGCACCGC TGCGCGAACC CGGTCGCCGA CGCGATCGGC GCGGTCGGCC GGCGGCTCCC CGGCGCCGAG GCCACCCGGG AGTTCACCGG CAGCGACGCG CCGGGGTCGG TCACGGTGCA GATCGCGGCG TCACCGCACG CCGAGTCCGC GCTGATCGCC GATGCGCTGC GGCGCGCGCA CCTCGTCGAC GGCGTGCCCT GGTCGCAGAT GGCGGTGATC GTCCGGTCGG TTCCGCGGAT GGGCGCCGCG CTGGGGCGCG CGTTGACCGC GGCGGGGGTG CCGCTGGACC TGCCGCAACC CGAGGTGCCG CTCGCCGAGC AGCCCGCGGT GCGGGCGCTG CTGACGGTGT TGGAGGCCAC CGCCGACGGC CTCGACGGTG AGCGGGCGCT GGCCCTGGTC ACCGGGCCGA TCGGGCGCGT CGACCCGATC TCGCTGCGGC AGTTGCGCCG CGCGCTGCGC CGCGCGGCGC CCGAATCACC CGGGGGGTTC TCCGACCTTC TGGTCGACGC GCTGCAACGC GACACCCCCG CACTTGCCGA CGGGCAGGCG CGGGCGCTGC GGCGCGTGTG CGCCGTGCTG ACCGCGGCCC GGCGCAGTGC CCGCGAGGGC AGCGACCCGC GGCACACGCT GTGGCAGGCG TGGCACCGGT CGGGGCTGCA GAAGCGTTGG CTGGCCGCCA GCGAGCGCGG TGGGCCCGCC GGTGCCCAGG CCGACCGTGA CCTCGACGCG GTCACCGCGA TGTTCGACGT CGCCGAGCAG TACGTCGCCC GTACCGCCGG GGCGTCGCTG CGCGGCCTGG TGGACCACAT CACGGCGCTG GCACTGCCTC CGGCGCGCCG CGATGAAGCC GCGCCGGACG CGGTGGCTCT GCTCAGCGCG CACTCGGCCC TCGGCCACGA GTGGGAATTC GTGGTGCTCG CCGGTGTACA GGAAGGGCTG TGGCCCAATG TCTCTCCGCG CGGTGGGGTT CTGGCCACCC AGCAGCTGGT GGACGTGATC GACGGGGTCT GCGCCCCGGG GCAACACACG TTGTCCAGCA GGGCTCCGCT GCTCGCCGAG GAGTGGCGGC TGCTCATCGC CGCCATGGGC CGGGCCCGCA GCCGGCTGCT GGTGACCGCG GTCGACAGCG ACTGCGGTGA CGACGCGATG TTGCCGTCGT CGTTCTGTCA CGAACTCGCC GCGCTGGCCA CCGAGCCGCA GTCGCAGCCG GCTCCTCCGG TCCGGGCGCC CCGGGTGCTG GCGCCGTCGG CCCTGGTGGG TCGGCTGAGA TCGGTGGTGT GCGCCGCACC CGGTGCCGTC GACGACGTCG AGAGGGATTG TGCGGCAGCG CAGTTGGCCC GGCTTGCCGA AGCGGGGGTG CACGGCGCCG ATCCCGCGTC GTGGTACGGG TCGCGGGAGT TGTCCAGCGC CGAGCCGCTG TGGGAGGACG GCGAGCAGGT GGTGACACTG TCGCCGTCAA CCCTGCAGAT GCTGTCGGAC TGCCCGCTGC GCTGGCTGCT CGAACGCCAC GGGGGCTCCC GCGGCCGCGA CGTGCGGTCC ACCCTGGGCT CACTGGTGCA CGCCCTGGTC TCCGAATCGG GGCGGACCGA GTCGCAGTTG CTCAACGGGC TCGAGAAGGT CTGGGAGGAG CTCCCTTTCG ACGCGCAGTG GTACTCCGAC AACGAGCGGG TCCGGCATCT GGAGATGCTC AGCACATTCC TGAGGTGGCG CGAAGGCACC CGGGGCGAGC TCACCGAGGT CGGCACGGAG ATCGAGGTCG ACGGTCAGAT CGCCGCGCCC GACGGGGAAC TGCCCGCGGT CCGGCTGCGT GGGCGCCTCG ACCGGCTCGA GCGCGACTCC GAGGGCCGAC TCGTGGTGAT CGACCTCAAG ACCGGCAAGA GTCCGGTCAG CAAGGACGAT GCGCAGAGCC ACGCACAGCT GGCGATGTAC CAGTTGGCCG TGGCCGCAGG CCTGCTCGCC GACGGTGACG AGCCCGGCGG AGGCCGGCTG GTCTACCTCG GCAAGACCAC CGGTGGTGGG GCGACCGAAC GCCACCAGGA CGCGCTGACC CCCGACGGCC GCGCCGAATG GGACGAGCAG GTGCACCGGG CCGCGGCGGC GACGCAGGGC CCGCAGTTCA CCGCGCGCGT CAACGACGGG TGCGCACACT GTCCTGTCCG CGCCATGTGC CCGGCCCAGA ACAGGAGTGA CACATGA
|
Protein sequence | MSAPHTELSP EALSRRDVRG TVRVLGGPGT GKSSLLVDTA AAHIAAGCDP ESVLLLTGSS RLSAQARAAI TTALLGAGAR SAVREPLVRT VHSYAFAVLR LAAQRNGSPP PRLITSAEQD GIIRELLAGD VEDGDASPVR WPERLRPALS TVGFATELRD LMARCSERGV DPLALQRIGR AAGRPEWQAA GRFAQAYEQV MLLRAAVGMA APQATVPALG AAELVGAALE AFATDADLLA AERARVQLLL VDDAQHLDPQ AARLVEVLAT GAELAVIAGD PHQTVFGYRG ADPALLRGEG PALTLTRSHR CANPVADAIG AVGRRLPGAE ATREFTGSDA PGSVTVQIAA SPHAESALIA DALRRAHLVD GVPWSQMAVI VRSVPRMGAA LGRALTAAGV PLDLPQPEVP LAEQPAVRAL LTVLEATADG LDGERALALV TGPIGRVDPI SLRQLRRALR RAAPESPGGF SDLLVDALQR DTPALADGQA RALRRVCAVL TAARRSAREG SDPRHTLWQA WHRSGLQKRW LAASERGGPA GAQADRDLDA VTAMFDVAEQ YVARTAGASL RGLVDHITAL ALPPARRDEA APDAVALLSA HSALGHEWEF VVLAGVQEGL WPNVSPRGGV LATQQLVDVI DGVCAPGQHT LSSRAPLLAE EWRLLIAAMG RARSRLLVTA VDSDCGDDAM LPSSFCHELA ALATEPQSQP APPVRAPRVL APSALVGRLR SVVCAAPGAV DDVERDCAAA QLARLAEAGV HGADPASWYG SRELSSAEPL WEDGEQVVTL SPSTLQMLSD CPLRWLLERH GGSRGRDVRS TLGSLVHALV SESGRTESQL LNGLEKVWEE LPFDAQWYSD NERVRHLEML STFLRWREGT RGELTEVGTE IEVDGQIAAP DGELPAVRLR GRLDRLERDS EGRLVVIDLK TGKSPVSKDD AQSHAQLAMY QLAVAAGLLA DGDEPGGGRL VYLGKTTGGG ATERHQDALT PDGRAEWDEQ VHRAAAATQG PQFTARVNDG CAHCPVRAMC PAQNRSDT
|
| |