Gene Mvan_5396 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5396 
Symbol 
ID4648076 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5777114 
End bp5779981 
Gene Length2868 bp 
Protein Length955 aa 
Translation table11 
GC content68% 
IMG OID639808872 
ProductDNA topoisomerase I 
Protein accessionYP_956173 
Protein GI120406344 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.585871 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0823046 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTGACG AGGAACGCGG CAGCGGTAAG AACGGCGCCG AGCCGCGCAG GGGGAATGGC 
TCGTCGGTGC GGAGACTCGT CATCGTCGAG TCGCCGACCA AGGCGCGCAA GATCGCAGGT
TACCTGGGGT CCAACTACAT CGTCGAGTCC TCACGCGGGC ACATTCGGGA CCTGCCGCGC
AACGCCGCCG ACGTCCCGGC GAAGTACAAA TCCGAGCCGT GGGCCCGCCT GGGCGTCAAC
GTCGAGCACA ACTTCGAGCC GCTCTACATC ATCAGCCCGG ACAAGAAGAG CACCGTCGCC
GACCTGAAGG ACAAGCTCAA GAACGTCGAC GAGCTCTATC TGGCCACCGA CGGTGACCGC
GAGGGCGAGG CCATCGCCTG GCACCTGCTG GAGACGCTGA AACCGCGCAT CCCGGTCAAG
CGGATGGTGT TCCACGAGAT CACCGAGCCC GCGATCCGCG CGGCCGCCGA AGACCCGCGC
GACCTGGACA ACGACCTGGT CGACGCGCAG GAGACCCGAC GCATCCTCGA CCGTCTCTAC
GGCTACGAGG TCAGCCCCGT GCTGTGGAAG AAGGTCGCGC CGAAGCTGTC GGCGGGCCGG
GTCCAGTCCG TGGCCACCCG CATCATCGTG CAGCGCGAAC GCGAGCGGAT GGCGTTCCGC
AGCGCCGGGT ACTGGGACGT CACCGCCGAG CTGGACGCCA GCGTCTCCGA CGCGCAGGCC
AGCCCGCCCA CCTTCGTCGC AAAGCTCAAC ACGGTCGACG GGCGCCGCGT GGCCACCGGC
CGCGACTTCG ACTCCCTCGG CGCGGTCCGC AAGCCCGACG AGGTGCTCGT CCTCGACGAG
GCCGCCGCCA ACGCCTTGGC CACCGGTCTG CGGGGCGCGC AGCTGGCGGT CTCGTCGGTC
GAGCAGAAGC CGTACACGCG CCGCCCGTAC GCGCCGTTCA TGACGTCGAC GCTGCAACAG
GAGGCGGGCC GCAAGCTGCG GTTCACGTCG GAGCGCACGA TGAGCATCGC GCAGCGGCTC
TACGAGAACG GCTACATCAC CTACATGCGT ACCGACTCGA CCACGCTGTC GCAGTCGGCC
ATTGACGCCG CCCGCAATCA GGCCCGTCAG CTCTACGGCG AGGAATACGT CCACCCGACG
CCGCGCCAGT ACACCCGCAA GGTCAAGAAC GCGCAGGAGG CGCACGAGGC GATCCGCCCC
GCCGGTGACG TGTTCCAGAC CCCCGGTCAG CTGCACAGCC AGCTCGACAC CGACGAGTTC
CGCCTCTACG AGCTGATCTG GCAGCGCACC GTCGCCTCGC AGATGGCCGA TGCCCGCGGC
ACCACGCTGA GCCTGCGGAT CGCCGGAGCC GCCCCGGCGA CGACATTGGG CGGAGGTACC
GCGTCCGACG TCCAGGTGGT GTTCAACGCC AGCGGCCGCA CCATCACGTT CCCTGGCTTC
CTGAAGGCCT ACGTCGAGAG CATCGACGAC CTGGCCGGCG GCGAGGCCGA CGACGCCGAG
AGCAGGCTGC CCAACCTCAC CCAGGGTCAG CGGGTGGACG CCAAGGGGCT GACCGCCGAC
GGCCACACCA CCTCGCCGCC CGCGCGCTAC ACCGAGGCCT CTCTGATCAA GGCGCTGGAG
GATCTCGGCA TCGGCCGGCC GTCGACGTAC AGCTCGATCA TCAAGACCAT CCAGGACCGC
GGTTACGTCC ACAAGAAGGG CAGCGCGCTG GTTCCGTCGT GGGTGGCGTT CGCCGTCATC
GGTCTGCTCG AGCAGCACTT CGGGCGTCTG GTCGACTACG ACTTCACCGC CGCGATGGAG
GACGAGCTCG ACGAGATCGC AGCAGGGCAC GAGCGACGCA CCAACTGGCT CAACAACTTC
TACTTCGGTG GCGAGCACGG CGCGGACGGT TCGATCGCCC GCTCGGGCGG GCTCAAGAAG
CTCGTCGGTG GCAACCTCGA AGAGATCGAC GCGCGAGAAG TCAACTCCAT CAAGCTCTTC
GACGATGCCG AAGGCCGCGC GGTCAACGTG CGCGTCGGAC GCAACGGTGC CTATCTCGAG
CGCATGGTGG CCGATCCGGA CAACCCCGGT GAGCTCAAAC CGCAGCGGGC CAACCTCAAG
GACGAGCTGA CGCCTGACGA GCTGACCCTT GAGCTGGCCG AAAAGCTCTT CGCCACACCG
CAAGAGGGCC GTTCGCTGGG TGTCGACCCG GCGACCGGGC ACGAGATCGT CGCCAAGGAC
GGCCGTTACG GCCCGTATGT CACCGAGGTG CTTCCTGAAC CGCCCGACGA GGGCGAAGCG
GGCGCCACGG CGAAGAAGGG CAAGAAGCCG ACCGGGCCGA AGCCGCGTAC CGGTTCGCTG
CTTCGCTCGA TGGATCTGGA GACCGTCACG CTGGAGGACG CGCTTCGGCT GCTGTCGCTG
CCGCGGGTGG TCGGCGTCGA TCCGGCCAGC GGTGAGGAGA TCACCGCGCA GAACGGCCGG
TACGGCCCAT ATCTCAAGCG CGGCACCGAC TCCCGGTCTC TTGCCACCGA GGAGCAGATG
TTCGACATCA CCCTCGAGGA GGCGTTGAAG ATCTACGCCG AGCCGAAGCG TCGCGGTCGG
CAGGGGGCGG CGACCCCGCC GCTGCGCGAG CTCGGCGTCG ACCCGGTGTC GGAGAAGCCG
ATGGTGATCA AGGACGGCCG GTTCGGGCCC TACGTCACCG ACGGTGAGAC CAACGCCAGC
CTGCGCAAGG GCGACGACGT GCTGTCGATC ACCGACGCGC GGGCGTCCGA GCTGCTGGCC
GACCGCCGCG CCCGGGGTCC GGTCAAGAAG AAGGCCGTCA AGAAGGCGCC TGCCAAGAAG
ACGCCCGCGA AGAAGACCGC TGCCAAGAAG GCCGCGAAGA AGGCCTGA
 
Protein sequence
MADEERGSGK NGAEPRRGNG SSVRRLVIVE SPTKARKIAG YLGSNYIVES SRGHIRDLPR 
NAADVPAKYK SEPWARLGVN VEHNFEPLYI ISPDKKSTVA DLKDKLKNVD ELYLATDGDR
EGEAIAWHLL ETLKPRIPVK RMVFHEITEP AIRAAAEDPR DLDNDLVDAQ ETRRILDRLY
GYEVSPVLWK KVAPKLSAGR VQSVATRIIV QRERERMAFR SAGYWDVTAE LDASVSDAQA
SPPTFVAKLN TVDGRRVATG RDFDSLGAVR KPDEVLVLDE AAANALATGL RGAQLAVSSV
EQKPYTRRPY APFMTSTLQQ EAGRKLRFTS ERTMSIAQRL YENGYITYMR TDSTTLSQSA
IDAARNQARQ LYGEEYVHPT PRQYTRKVKN AQEAHEAIRP AGDVFQTPGQ LHSQLDTDEF
RLYELIWQRT VASQMADARG TTLSLRIAGA APATTLGGGT ASDVQVVFNA SGRTITFPGF
LKAYVESIDD LAGGEADDAE SRLPNLTQGQ RVDAKGLTAD GHTTSPPARY TEASLIKALE
DLGIGRPSTY SSIIKTIQDR GYVHKKGSAL VPSWVAFAVI GLLEQHFGRL VDYDFTAAME
DELDEIAAGH ERRTNWLNNF YFGGEHGADG SIARSGGLKK LVGGNLEEID AREVNSIKLF
DDAEGRAVNV RVGRNGAYLE RMVADPDNPG ELKPQRANLK DELTPDELTL ELAEKLFATP
QEGRSLGVDP ATGHEIVAKD GRYGPYVTEV LPEPPDEGEA GATAKKGKKP TGPKPRTGSL
LRSMDLETVT LEDALRLLSL PRVVGVDPAS GEEITAQNGR YGPYLKRGTD SRSLATEEQM
FDITLEEALK IYAEPKRRGR QGAATPPLRE LGVDPVSEKP MVIKDGRFGP YVTDGETNAS
LRKGDDVLSI TDARASELLA DRRARGPVKK KAVKKAPAKK TPAKKTAAKK AAKKA