Gene Mflv_1394 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMflv_1394 
Symbol 
ID4972720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium gilvum PYR-GCK 
KingdomBacteria 
Replicon accessionNC_009338 
Strand
Start bp1449866 
End bp1452691 
Gene Length2826 bp 
Protein Length941 aa 
Translation table11 
GC content68% 
IMG OID640455597 
ProductDNA topoisomerase I 
Protein accessionYP_001132664 
Protein GI145221986 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.590373 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTGACG GGGACCGCGG CAGCGGCAAG AACGGGTCCG TGCGGCGACT CGTCATAGTC 
GAGTCGCCGA CCAAAGCGCG CAAAATCGCA GGTTATCTGG GGTCGAACTA CGTCGTCGAG
TCCTCCCGCG GGCACATCCG CGACCTGCCG CGCGCGGCCG CCGACGTCCC GGCCAAGTAC
AAGTCGGAAC CGTGGGCGCG CCTCGGCGTC AACGTCGACG CCGACTTCGA GCCGCTCTAC
ATCATCAGCC CAGATAAAAA GGCCACCGTC GCCGATCTGA AGGACAAGCT CAAGAACGTC
GACGAGCTCT ATCTGGCCAC CGACGGTGAC CGCGAGGGCG AGGCGATCGC CTGGCATCTG
CTGGAGACGC TGAAACCGCG CATCCCGGTC AAGCGGATGG TGTTCCACGA GATCACCGAG
CCCGCGATCC GCGCGGCCGC CGAGGACCCC CGCGACCTCG ACAACGACCT GGTCGACGCC
CAGGAGACCC GTCGCATCCT GGACCGTCTC TACGGCTACG AGGTCAGCCC CGTGCTGTGG
AAGAAGGTCG CGCCGAAGCT GTCGGCCGGC CGCGTGCAGT CGGTGGCGAC GCGCATCATC
GTCCAGCGCG AACGCGAGCG GATGGCGTTC CGCAGCGCCG GCTACTGGGA CGTCACCGCC
GAACTCGACG CCAGCGTGTC CGACGCGCAG GCCACACCGC CCACGTTCGT CGCGAAGCTC
AACACCGTCG ACGGCCGCCG CGTGGCCGCA GGCCGCGATT TCGACTCCCT CGGCGCGGTC
AAGAAGCCCG GTGAGGTGCT TGTCCTCGAC GAAGCCGCCG CGAACGCGCT GGCCGGTGGT
CTGCGCGGCG CCCAGCTGTC GGTCTCCTCG GTCGAGCAGA AGCCCTACAC CCGCCGCCCG
TACGCGCCGT TCATGACCTC GACGCTGCAG CAGGAAGCCG GCCGCAAGCT GCGGATGTCC
TCGGAGCGCA CGATGAGCAT CGCGCAGCGT CTGTACGAGA ACGGCTACAT CACCTACATG
CGTACCGACT CGACCACGCT GTCGCAGTCG GCCATTGACG CCGCACGCAA CCAGGCCCGC
CAGCTCTACG GCGAGGAGTA CGTCCACCCG ACGCCGCGCC AGTACACCCG CAAGGTCAAG
AACGCGCAGG AGGCCCACGA GGCCATCCGC CCCGCCGGTG ACGTGTTCCA GACCCCCGGC
CAGCTGCACA GCCAGCTCGA CACCGACGAG TTCCGTCTCT ACGAGCTGAT CTGGCAGCGC
ACCGTCGCCT CCCAGATGGC CGACGCCCGG GGTACGACGC TGAGCCTGCG GATCGCCGGA
GCTGCCACCA GCGGTGAGCA GGTCGTCTTC AACGCCAGCG GTCGCACGAT CACCTTCGCG
GGCTTCCTGA AGGCGTACGT CGAGAGCCTC GACGAGCAGG CCGGCGGCGA GGCCGACGAC
GCCGAGAGCC GGCTGCCGAA CCTGACCCAG GGTCAGCGTG TCGACGCCAA GGACCTGACC
GCCGACGGCC ACACCACTTC GCCGCCCGCG CGCTACACCG AGGCGTCGTT GATCAAGGCT
CTCGAAGATC TCGGAATCGG CCGGCCGTCG ACGTACAGCT CGATCATCAA GACCATCCAG
GACCGCGGCT ATGTCCACAA GAAGGGCAGC GCGCTGGTCC CGTCGTGGGT GGCGTTCGCC
GTGATCGGCC TGCTGGAGCA GCATTTCAGC CGGTTGGTCG ATTACGACTT CACCGCCGCG
ATGGAAGACG AGCTCGACGA GATCGCCGCC GGCAACGAGC GACGGACCAA CTGGCTCAAC
AACTTCTACT TCGGCGGCGA GCACGGAGTC GAGGGCTCGA TCGCCCGGGA GGGCGGGCTC
AAGAAACTGG TCGGCGGCAA CCTCGAAGAG ATCGACGCCC GAGAAGTCAA CTCCATCAAG
CTGTTCGACG ATTCGGAAGG TCGCGCGGTC AACGTCCGCG TCGGCCGCAA CGGCGCGTAT
CTGGAGCGGA TGGTTGCGGA CCCTGATAAT CCGGGTGAGC TGAAGCCGCA GCGCGCCAAC
CTCAAGGACG AGCTGACGCC CGACGAGCTG ACCCTGGAGC TGGCCGAGAA GCTGTTCGCC
ACACCGCAAG AGGGGCGTTC GCTGGGGATC GACCCGGCGA CGGGACACGA GATCGTCGCC
AAGGACGGGC GTTACGGACC GTACGTCACC GAGGTCCTGC CCGAGCCGCC CGATGACGGT
GAGGCCGGGG CGACGGCCAA GAAAGGCAAG AAGCCGACCG GCCCCAAGCC GCGTACCGGT
TCGCTGCTGC GGTCGATGGA CCTGGAGACC GTCACCCTCG ACGACGCGCT GCGGCTGCTC
TCGCTGCCCC GCGTGGTCGG TGTCGACCCC GCCAACGGTG AGGAGATCAC CGCGCAGAAC
GGCCGCTACG GCCCATATCT GAAGCGCGGC ACCGATTCCC GGTCGTTGGC GACCGAAGAG
CAGATGTTCG ACATCACGCT CGAGGAGGCC CTCAAGATCT ACGCCGAGCC CAAGCGCCGG
GGCCGCCAGG GTGCGGCGAC CCCGCCGCTG CGTGAACTCG GCGTGGACCC GGTCTCGGAG
AAGCCGATGG TGATCAAGGA CGGCCGCTTC GGCCCGTACG TCACCGACGG CGAGACCAAC
GCGAGTCTGC GCAAGGGCGA CGACGTCATG TCGATCACCG ATGCGCGCGC ATCCGAACTG
CTCGCCGACC GCCGTGCGCG CGGTCCGGTG AAGAAGAAGG CCGCGAAGAA GGCCGCAGTG
AAGAAGACCG CAGCGAAGAA GGCCGCCAAG AAGGCGCCGG CGAAGAAAGC GGCCAAGAAG
GCCTGA
 
Protein sequence
MADGDRGSGK NGSVRRLVIV ESPTKARKIA GYLGSNYVVE SSRGHIRDLP RAAADVPAKY 
KSEPWARLGV NVDADFEPLY IISPDKKATV ADLKDKLKNV DELYLATDGD REGEAIAWHL
LETLKPRIPV KRMVFHEITE PAIRAAAEDP RDLDNDLVDA QETRRILDRL YGYEVSPVLW
KKVAPKLSAG RVQSVATRII VQRERERMAF RSAGYWDVTA ELDASVSDAQ ATPPTFVAKL
NTVDGRRVAA GRDFDSLGAV KKPGEVLVLD EAAANALAGG LRGAQLSVSS VEQKPYTRRP
YAPFMTSTLQ QEAGRKLRMS SERTMSIAQR LYENGYITYM RTDSTTLSQS AIDAARNQAR
QLYGEEYVHP TPRQYTRKVK NAQEAHEAIR PAGDVFQTPG QLHSQLDTDE FRLYELIWQR
TVASQMADAR GTTLSLRIAG AATSGEQVVF NASGRTITFA GFLKAYVESL DEQAGGEADD
AESRLPNLTQ GQRVDAKDLT ADGHTTSPPA RYTEASLIKA LEDLGIGRPS TYSSIIKTIQ
DRGYVHKKGS ALVPSWVAFA VIGLLEQHFS RLVDYDFTAA MEDELDEIAA GNERRTNWLN
NFYFGGEHGV EGSIAREGGL KKLVGGNLEE IDAREVNSIK LFDDSEGRAV NVRVGRNGAY
LERMVADPDN PGELKPQRAN LKDELTPDEL TLELAEKLFA TPQEGRSLGI DPATGHEIVA
KDGRYGPYVT EVLPEPPDDG EAGATAKKGK KPTGPKPRTG SLLRSMDLET VTLDDALRLL
SLPRVVGVDP ANGEEITAQN GRYGPYLKRG TDSRSLATEE QMFDITLEEA LKIYAEPKRR
GRQGAATPPL RELGVDPVSE KPMVIKDGRF GPYVTDGETN ASLRKGDDVM SITDARASEL
LADRRARGPV KKKAAKKAAV KKTAAKKAAK KAPAKKAAKK A