Gene Mvan_1802 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_1802 
Symbol 
ID4644058 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp1912457 
End bp1915717 
Gene Length3261 bp 
Protein Length1086 aa 
Translation table11 
GC content73% 
IMG OID639805290 
ProductUvrD/REP helicase 
Protein accessionYP_952630 
Protein GI120402801 
COG category[L] Replication, recombination and repair 
COG ID[COG0210] Superfamily I DNA and RNA helicases
[COG2887] RecB family exonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.103668 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0746176 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAACGC CACGCTACAG CCCCGCCGAA CTGTCCAGGG CGCTGGGCCT TTTCGAACCC 
ACCGACGAGC AGGCCGCCGT GATCGCCGCG GAACCGGGGC CGCTGGTGGT GATCGCCGGC
GCCGGGGCGG GCAAGACCGA GACCATGGCG GCGCGGGTGG TGTGGCTGGT CGCCAACGGT
TACGCGCGCC CGGGCGAGGT ACTCGGCCTG ACGTTCACCC GTAAGGCCGC GGGCCAGCTG
TTGCGCCGGG TGCGGGCGCG GCTGGCGCGG CTGGCAGGCG CAGGTCTGAT GATTCCGACG
GGCGGCGCCG ACTTCGCCGA CGACCCCGTC ACCATCAGCA CCTACCACGC CTTCGCCGGC
AATCTGCTGC GCGAGTTCGG CCTGTTGTTG CCGGTTGAGC CAGACACCCG ACTGCTCGGC
GAAACAGAGT TGTGGCAGTT GGCCTTCCGG GTGGTGTGCG CACACCCCGC GGCGCTGGAC
ACCGAGAAGT CGCCGGCGTC GATCACCGAC ATGGTGCTGC GGCTGGCCGG CCAGCTGTCC
GAACACCTCG TCGACACCGC CGCGCTGCGC GACACCCACG TCGAACTCGA GCGGCTCGTG
CTGAACCTGC CCGCAGGCAG ATACCAGCGT GACCGCGGGC CCAGCCAGTG GCTGCTGCGC
ATGCTGGCCA CCCAGACCGA GCGCACCATG CTCGTGCCGC TCGTCGATGC GCTGCACCGG
CGGATGCGCG AGGAACGGGT CATGGACTTC GGTATGCAGA TGGCTTCGGC CGCCCGGCTG
GCCGCCCAGT TCCCGCAGGT CGGTGCGCAG CTGCGGCAGC GCTACCGCGT GGTGCTGCTC
GACGAGTACC AGGACACCGG GCACGCCCAG CGGGTGGCGC TGTCCTCGCT GTTCGGTGGC
GGCGCCGACG ACGGGCTCGC CCTGACCGCC GTCGGCGACC CGATCCAGTC CATCTACGGC
TGGCGGGGCG CCTCGGCGAC CAACCTGCCG CGCTTCACCA CCGATTTCCC GCTCTCGGAT
GGCACCCCGG CGCCGACGCT GGAACTGCGC ACCAGTTGGC GCAATCCGCC CGAGGTGCTG
CACCTGGCCA ACGAGGTGTC GGTGGATGCC CGCCGGCGCT CGGTGGCCGT GCGCGCGCTG
CAACCCCGTC CAGGCGCCGA GCCGGGGTCG GTGCGGTGCG CGTTGCTGTC CGACGTCGAA
CGTGAACGCG ACTGGGTCGC CGACCAGATC GCGCGCCGGT GGCACGCCGG CATCGAGGCC
GACGGGGCGG CGCCGACGGC CGCGGTCCTG GTGCGCCGCA ACGCCGACGC CGCCCCGATG
GCCGACGCGC TGACCCGCCG GGGCGTGGCG GTGGAGGTGG TCGGCCTGGC GGGCCTGCTG
GCCGTCCCTG AGGTCGCCGA TGTGGTCGCG ATGCTGAGGC TGGCGGCCGA CCCGACCGCA
GGCGCCGCCG CGGTGCGGGT GCTCACCGGG CCGCGGTGGC GCCTGGGCGC GCGCGACGTG
GCGGCGTTGT GGCGGCGTGC GGTCGCGCTG GTCGAGCCGG GCGCCTCCTC GGACGGGGAG
TCGGCGACCG CGGGGATCGT CGCGCAGCTG GGGCCCGACG CCGACGCCGC CTGCCTGGCC
GACGCGATCT GCGACCCCGG GCCCGCCACG GCGTACTCCG AGGCGGGGCA CGCCAGGATC
GTCGCGCTGG GCCGGGAGCT GACCGGGCTG CGGGCGCACC TGCAGAGCCC ACTGCCGGAC
CTGCTCGCCG AGGTGCGTCG CGTCCTGGGG ATCGACACCG AAGTGCGTGC GGCGCAACCG
GTCTCGGCGG GATGGTCGGG CACCGAGCAT CTCGACGCGT TCGGGGACGT CGTCGCGGAC
TTCGCGGCCC GTGGCGGGGC GACCGTTTCG GCGCTGCTCG GCTACCTCGA CATCGCGGAG
CAAGTGGAGA ACGGTCTGGC GCCGGCCGAG GTGACCGTGT CCGCCGACCG TGTGCAGATC
CTGACCGTGC ACGCCGCCAA GGGCCTGGAG TGGCAGATCG TCGCCGTGCC GCACCTGTCC
GGGCGGGTGT TCCCGTCCAC CGCGTCCCCA CGCACCTGGC TCACCGACGC TGCCGACCTG
CCGCCACTGC TGCGCGGCGA CTGCGCGACG GTGTCCGAGC ATGGGGTTCC GGTGCTCGAC
ACGTCCGATG TCAGCGACCG AAAAGGGCTG TCCGACAAGA TCTACGACCA CAAACGCAGC
CTGGAACAGC GGCGCACCGA CGAAGAGCGC CGGCTGCTGT ATGTGGCGAT CACCCGCGCC
GAGCACACCT TGTTGGTGTC GGGCCACCAC TGGGGTGCCA CCGAGTCGAA GCCGAGGGGG
CCGTCGACGT TCCTGTGCGA GCTCAAGGAC GTCATCGACG CGTCGGTGCA GTCGGGCTCG
CCCTGCGGTG CCGTCGACGT CTGGGCTGAC GCGCCCGCCG ACGGGGAACC GAACCCGCTG
CGTGACCGCG TCGTCGAGGC CGTGTGGCCC GAGGACCCGT TGGCGGGCCG CCGCGCACAC
ACCGACCATG GGGCACGGCT GGTGACCACT GCGATGTCCA CCGCCGGCGG GGCCGGTCCA
CAGCCCGACG TGCACGGCTG GACCGCCGAT GTCGACGCGC TGCTGGCCGA ACGGGAACTG
GCGGCGCAGC GACCCGTGCC GCCGCTGCCC CTGCAACTGT CGGTCAGTGC GATGGTCGAA
CTCGGCCGTG ATCCCGAGGC GGTCGCACAG CGGCTGCAGC GGCGGCTACC CCGGCGCCCC
GATGCACAGG CGTTGCTGGG CACGGCATTT CACGAATGGG TGCAGCGGTA CTTCCAGGCG
GAGAAGCTGT TCGATCTCGA TGACCTACCG GGGGCCGTCG ACGCCGACCG GCAGGACCGA
GGCGAGCTGG AGGAATTGCA GGCCGCGTTC GCGCTGTCCC CGTGGGCCGC CCGCACCCCG
ATCGATGTCG AGGTGCCGTT CGACATGATG ATCGCGGGGC GGGTGGTACG CGGCCGCATC
GACGCCGTGT TCGCCGACGG CGACGGGGTC ATGGTGGTGG ACTGGAAGAC CGGCGAGCCG
CCGGCAACCG AAGAAGAGTT GCAGCACAAC GCTGTTCAGC TTGCGGTGTA TCGCCTGGCG
TGGGCGCGGC TGCACGACTG CCCCGTGTCG TCGGTGCGTG CTGCCTTCCA CTACGTTCGG
TCGGGGCGCA CGGTCGTGCC CGACGGGCTG CCCGATGCCG ACGACCTGGC CGCGCTGCTG
GCCGACCCGC CCGCCGCCTG A
 
Protein sequence
MTTPRYSPAE LSRALGLFEP TDEQAAVIAA EPGPLVVIAG AGAGKTETMA ARVVWLVANG 
YARPGEVLGL TFTRKAAGQL LRRVRARLAR LAGAGLMIPT GGADFADDPV TISTYHAFAG
NLLREFGLLL PVEPDTRLLG ETELWQLAFR VVCAHPAALD TEKSPASITD MVLRLAGQLS
EHLVDTAALR DTHVELERLV LNLPAGRYQR DRGPSQWLLR MLATQTERTM LVPLVDALHR
RMREERVMDF GMQMASAARL AAQFPQVGAQ LRQRYRVVLL DEYQDTGHAQ RVALSSLFGG
GADDGLALTA VGDPIQSIYG WRGASATNLP RFTTDFPLSD GTPAPTLELR TSWRNPPEVL
HLANEVSVDA RRRSVAVRAL QPRPGAEPGS VRCALLSDVE RERDWVADQI ARRWHAGIEA
DGAAPTAAVL VRRNADAAPM ADALTRRGVA VEVVGLAGLL AVPEVADVVA MLRLAADPTA
GAAAVRVLTG PRWRLGARDV AALWRRAVAL VEPGASSDGE SATAGIVAQL GPDADAACLA
DAICDPGPAT AYSEAGHARI VALGRELTGL RAHLQSPLPD LLAEVRRVLG IDTEVRAAQP
VSAGWSGTEH LDAFGDVVAD FAARGGATVS ALLGYLDIAE QVENGLAPAE VTVSADRVQI
LTVHAAKGLE WQIVAVPHLS GRVFPSTASP RTWLTDAADL PPLLRGDCAT VSEHGVPVLD
TSDVSDRKGL SDKIYDHKRS LEQRRTDEER RLLYVAITRA EHTLLVSGHH WGATESKPRG
PSTFLCELKD VIDASVQSGS PCGAVDVWAD APADGEPNPL RDRVVEAVWP EDPLAGRRAH
TDHGARLVTT AMSTAGGAGP QPDVHGWTAD VDALLAEREL AAQRPVPPLP LQLSVSAMVE
LGRDPEAVAQ RLQRRLPRRP DAQALLGTAF HEWVQRYFQA EKLFDLDDLP GAVDADRQDR
GELEELQAAF ALSPWAARTP IDVEVPFDMM IAGRVVRGRI DAVFADGDGV MVVDWKTGEP
PATEEELQHN AVQLAVYRLA WARLHDCPVS SVRAAFHYVR SGRTVVPDGL PDADDLAALL
ADPPAA