Gene Mvan_1801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_1801 
Symbol 
ID4647437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp1909344 
End bp1912460 
Gene Length3117 bp 
Protein Length1038 aa 
Translation table11 
GC content74% 
IMG OID639805289 
ProductUvrD/REP helicase 
Protein accessionYP_952629 
Protein GI120402800 
COG category[L] Replication, recombination and repair 
COG ID[COG0210] Superfamily I DNA and RNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.078646 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCGC CGCACACCGA ACTCAGCCCC GAGGCCCTGA GCCGACGGGA CGTCCGAGGC 
ACAGTCCGGG TGCTGGGCGG GCCCGGCACC GGTAAGAGCA GCCTGCTCGT CGACACCGCC
GCGGCCCACA TCGCGGCCGG CTGTGATCCG GAATCGGTTC TGCTGCTGAC GGGTTCGTCG
CGGCTGAGCG CCCAGGCCAG GGCGGCGATC ACGACGGCGC TGCTGGGCGC GGGTGCGCGC
AGCGCCGTGC GGGAGCCGCT GGTGCGCACC GTGCACTCGT ACGCCTTCGC GGTGTTGCGC
CTGGCCGCGC AGCGCAACGG GAGCCCGCCG CCGCGCCTGA TCACCAGCGC CGAGCAGGAC
GGGATCATCC GTGAGCTGCT GGCCGGCGAT GTCGAGGACG GCGACGCGTC GCCGGTGCGG
TGGCCGGAGC GGCTGCGGCC CGCGCTGAGC ACCGTCGGTT TCGCCACCGA GCTGCGCGAC
CTGATGGCCC GCTGCAGCGA GCGTGGTGTC GATCCGCTTG CGCTGCAACG CATCGGCCGG
GCGGCCGGCC GGCCGGAATG GCAGGCCGCA GGCCGGTTCG CGCAGGCCTA CGAGCAGGTG
ATGCTGCTGC GCGCGGCGGT GGGCATGGCC GCCCCGCAGG CCACGGTTCC CGCGCTGGGT
GCCGCCGAGT TGGTCGGGGC GGCGCTGGAG GCGTTCGCCA CCGACGCCGA TCTGCTGGCC
GCCGAACGGG CCCGGGTGCA GCTGCTGCTG GTCGACGACG CCCAGCATCT CGACCCGCAG
GCGGCCCGCC TGGTCGAGGT GCTGGCCACC GGGGCCGAGC TGGCGGTGAT CGCCGGGGAC
CCACACCAGA CGGTGTTCGG TTACCGGGGC GCCGATCCGG CCCTGCTGCG CGGGGAGGGG
CCGGCACTGA CGCTGACCCG TTCGCACCGC TGCGCGAACC CGGTCGCCGA CGCGATCGGC
GCGGTCGGCC GGCGGCTCCC CGGCGCCGAG GCCACCCGGG AGTTCACCGG CAGCGACGCG
CCGGGGTCGG TCACGGTGCA GATCGCGGCG TCACCGCACG CCGAGTCCGC GCTGATCGCC
GATGCGCTGC GGCGCGCGCA CCTCGTCGAC GGCGTGCCCT GGTCGCAGAT GGCGGTGATC
GTCCGGTCGG TTCCGCGGAT GGGCGCCGCG CTGGGGCGCG CGTTGACCGC GGCGGGGGTG
CCGCTGGACC TGCCGCAACC CGAGGTGCCG CTCGCCGAGC AGCCCGCGGT GCGGGCGCTG
CTGACGGTGT TGGAGGCCAC CGCCGACGGC CTCGACGGTG AGCGGGCGCT GGCCCTGGTC
ACCGGGCCGA TCGGGCGCGT CGACCCGATC TCGCTGCGGC AGTTGCGCCG CGCGCTGCGC
CGCGCGGCGC CCGAATCACC CGGGGGGTTC TCCGACCTTC TGGTCGACGC GCTGCAACGC
GACACCCCCG CACTTGCCGA CGGGCAGGCG CGGGCGCTGC GGCGCGTGTG CGCCGTGCTG
ACCGCGGCCC GGCGCAGTGC CCGCGAGGGC AGCGACCCGC GGCACACGCT GTGGCAGGCG
TGGCACCGGT CGGGGCTGCA GAAGCGTTGG CTGGCCGCCA GCGAGCGCGG TGGGCCCGCC
GGTGCCCAGG CCGACCGTGA CCTCGACGCG GTCACCGCGA TGTTCGACGT CGCCGAGCAG
TACGTCGCCC GTACCGCCGG GGCGTCGCTG CGCGGCCTGG TGGACCACAT CACGGCGCTG
GCACTGCCTC CGGCGCGCCG CGATGAAGCC GCGCCGGACG CGGTGGCTCT GCTCAGCGCG
CACTCGGCCC TCGGCCACGA GTGGGAATTC GTGGTGCTCG CCGGTGTACA GGAAGGGCTG
TGGCCCAATG TCTCTCCGCG CGGTGGGGTT CTGGCCACCC AGCAGCTGGT GGACGTGATC
GACGGGGTCT GCGCCCCGGG GCAACACACG TTGTCCAGCA GGGCTCCGCT GCTCGCCGAG
GAGTGGCGGC TGCTCATCGC CGCCATGGGC CGGGCCCGCA GCCGGCTGCT GGTGACCGCG
GTCGACAGCG ACTGCGGTGA CGACGCGATG TTGCCGTCGT CGTTCTGTCA CGAACTCGCC
GCGCTGGCCA CCGAGCCGCA GTCGCAGCCG GCTCCTCCGG TCCGGGCGCC CCGGGTGCTG
GCGCCGTCGG CCCTGGTGGG TCGGCTGAGA TCGGTGGTGT GCGCCGCACC CGGTGCCGTC
GACGACGTCG AGAGGGATTG TGCGGCAGCG CAGTTGGCCC GGCTTGCCGA AGCGGGGGTG
CACGGCGCCG ATCCCGCGTC GTGGTACGGG TCGCGGGAGT TGTCCAGCGC CGAGCCGCTG
TGGGAGGACG GCGAGCAGGT GGTGACACTG TCGCCGTCAA CCCTGCAGAT GCTGTCGGAC
TGCCCGCTGC GCTGGCTGCT CGAACGCCAC GGGGGCTCCC GCGGCCGCGA CGTGCGGTCC
ACCCTGGGCT CACTGGTGCA CGCCCTGGTC TCCGAATCGG GGCGGACCGA GTCGCAGTTG
CTCAACGGGC TCGAGAAGGT CTGGGAGGAG CTCCCTTTCG ACGCGCAGTG GTACTCCGAC
AACGAGCGGG TCCGGCATCT GGAGATGCTC AGCACATTCC TGAGGTGGCG CGAAGGCACC
CGGGGCGAGC TCACCGAGGT CGGCACGGAG ATCGAGGTCG ACGGTCAGAT CGCCGCGCCC
GACGGGGAAC TGCCCGCGGT CCGGCTGCGT GGGCGCCTCG ACCGGCTCGA GCGCGACTCC
GAGGGCCGAC TCGTGGTGAT CGACCTCAAG ACCGGCAAGA GTCCGGTCAG CAAGGACGAT
GCGCAGAGCC ACGCACAGCT GGCGATGTAC CAGTTGGCCG TGGCCGCAGG CCTGCTCGCC
GACGGTGACG AGCCCGGCGG AGGCCGGCTG GTCTACCTCG GCAAGACCAC CGGTGGTGGG
GCGACCGAAC GCCACCAGGA CGCGCTGACC CCCGACGGCC GCGCCGAATG GGACGAGCAG
GTGCACCGGG CCGCGGCGGC GACGCAGGGC CCGCAGTTCA CCGCGCGCGT CAACGACGGG
TGCGCACACT GTCCTGTCCG CGCCATGTGC CCGGCCCAGA ACAGGAGTGA CACATGA
 
Protein sequence
MSAPHTELSP EALSRRDVRG TVRVLGGPGT GKSSLLVDTA AAHIAAGCDP ESVLLLTGSS 
RLSAQARAAI TTALLGAGAR SAVREPLVRT VHSYAFAVLR LAAQRNGSPP PRLITSAEQD
GIIRELLAGD VEDGDASPVR WPERLRPALS TVGFATELRD LMARCSERGV DPLALQRIGR
AAGRPEWQAA GRFAQAYEQV MLLRAAVGMA APQATVPALG AAELVGAALE AFATDADLLA
AERARVQLLL VDDAQHLDPQ AARLVEVLAT GAELAVIAGD PHQTVFGYRG ADPALLRGEG
PALTLTRSHR CANPVADAIG AVGRRLPGAE ATREFTGSDA PGSVTVQIAA SPHAESALIA
DALRRAHLVD GVPWSQMAVI VRSVPRMGAA LGRALTAAGV PLDLPQPEVP LAEQPAVRAL
LTVLEATADG LDGERALALV TGPIGRVDPI SLRQLRRALR RAAPESPGGF SDLLVDALQR
DTPALADGQA RALRRVCAVL TAARRSAREG SDPRHTLWQA WHRSGLQKRW LAASERGGPA
GAQADRDLDA VTAMFDVAEQ YVARTAGASL RGLVDHITAL ALPPARRDEA APDAVALLSA
HSALGHEWEF VVLAGVQEGL WPNVSPRGGV LATQQLVDVI DGVCAPGQHT LSSRAPLLAE
EWRLLIAAMG RARSRLLVTA VDSDCGDDAM LPSSFCHELA ALATEPQSQP APPVRAPRVL
APSALVGRLR SVVCAAPGAV DDVERDCAAA QLARLAEAGV HGADPASWYG SRELSSAEPL
WEDGEQVVTL SPSTLQMLSD CPLRWLLERH GGSRGRDVRS TLGSLVHALV SESGRTESQL
LNGLEKVWEE LPFDAQWYSD NERVRHLEML STFLRWREGT RGELTEVGTE IEVDGQIAAP
DGELPAVRLR GRLDRLERDS EGRLVVIDLK TGKSPVSKDD AQSHAQLAMY QLAVAAGLLA
DGDEPGGGRL VYLGKTTGGG ATERHQDALT PDGRAEWDEQ VHRAAAATQG PQFTARVNDG
CAHCPVRAMC PAQNRSDT