Gene Namu_0738 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_0738 
Symbol 
ID8446325 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp810703 
End bp813639 
Gene Length2937 bp 
Protein Length978 aa 
Translation table11 
GC content72% 
IMG OID645039873 
ProductDNA topoisomerase I 
Protein accessionYP_003200141 
Protein GI258650985 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCACTG GCACCTCCGC GAGCACCCGC AACGGCGGCG TGCGGCTGGT CGTCGTCGAG 
TCACCGAGCA AGGCGAAGAC CATCTCCGGT TACCTCGGCG ACGGCTACAT CGTCGAGTCC
TCGGTCGGGC ACATCCGCGA CCTGCCCCGC GGCGCCGCCG ACGTACCGGC CAAGTACAAG
GGCGAGCCCT GGGCCCGGCT GGGCGTGGAC ACCGAGCACG GCTTCGAGCC GCTCTACGTC
GTCTCCCCGG AGAAGAAGGC CCAGGTCGCT AAGCTCAAGT CGCTGCTGGC CGGGGCCGAC
GAGCTCTACC TGGCCACAGA CGAGGACCGC GAGGGCGAGG CCATCGCCTG GCATCTGCTG
GAGACCCTCA AGCCCAAGGT CCCGGTCAAG CGGATGGTCT TCCACGAGAT CACCCCGGCC
GCGATCCGCG AGGCCGCCGC CAACCCGCGC GCCCTGGACG AGAACCTGGT CGACGCCCAG
GAGACCCGCC GCATCCTGGA CCGGTTGTAC GGCTACGAGG TCTCCCCGGT GCTGTGGAAG
AAGGTGATGC CCAAGCTGTC CGCCGGTCGC GTGCAGTCGG TGGCCACCCG CATCATCGTG
CAGCGCGAAC GCGAGCGGAT GGCGTTCCGC TCGGGCACCT ACTGGGGCCT GGACGCGCTC
ATGTCGCCGG CCGGCACCGG GGCCGAGCCG TTCAAGTCGG CGCTGAGCAC CGTGGACGGG
CGCCGGCTGG CCGCCGGCCG CGACTTCGAC CCGGCCACCG GCGGGCTCAA GGCCGACGCC
GACGTGCTGC TGCTGGACGA GGACGGGGCG CGCACCCTGG CCACCGCGCT GGCCGGCGGC
ACCGCCACCG TCACCTCGGT GGAGGAGAAG CCCTACACCC GCAAGCCCTA CCCGCCGTTC
ATGACCTCCA CCCTGCAGCA GGAGGCCGGC CGCAAACTGG GGTTCAACTC CGAGCGCACC
ATGCGCACCG CGCAGCGGCT GTACGAGAAC GGCTTCATCA CCTACATGCG AACCGACTCG
ACCACCCTGA GCTCGTCGGC CCAGGAGGCC GCCCGGGCCC AGGCCCGCGA GCTGTACGGG
CCGGAGTACG TGCCGCCGAC CCCGCGGCAG TACACCCGCA AGGTCAAGAA CGCCCAGGAG
GCGCACGAGG CCATCCGCCC GGCCGGCGAC AACTTCCGCA CCCCGGGTCA GGTCGCCAAC
CAGATCTCCG GCGACGAGTA CCGGCTCTAC GAGCTGATCT GGCAGCGCAC CATCGCCTCG
CAGATGGTCG ACGCCCGCGG CCTGACCCTG TCGGTCAAGA TCGCCGCGAC CGCGCGGGAG
CAGGAGTGCG TGTTCAGCGC GTCCGGCCGC ACCATCACCT TCCCCGGGTT CCTGCGGGCC
TACGTGGAGA CGGTGGACGC GGAGGCCGGC GGCGAGGCCG ACGACGCCGA ACGGCGGCTG
CCCAAGCTGG AGACCGGGCA AAAGCTCGAC ATCCGTGACC TGATCCCGGC CAGCCACGTG
ACCACCCCGC CGGCCCGGTA CACCGAGCCG TCGCTGATCG GCGCGTTGCA GGACCTGGGC
ATCGGCCGTC CCTCCACCTA CACCTCGATC ATCCGCACGA TCATCGACCG CGGGTACGTG
TGGAAGAAGG GGCAGGCGCT GGTCCCGTCC TGGATCGCGT TCGCCGTCAT CGGCCTGCTC
GAGCAGCACT TCTCCCGGCT GGTGGACTAC AACTTCACCG CGGCGATGGA GGACGAGCTC
GACGGCATCG CCGACGGGCG GATCGGCCGC ACCGACTGGC TGTCCGCGTT CTACTTCGGC
GGCGACCTGG GCCCGGCCGG CTCGGTCGGC CGCTCCGGCG GCCTGAAGAA GCTGGTCGGC
GAGCGGCTGG AGGACATCGA CGCCCGCGAG GTCAACTCGT TGCCGTTGCT GACCGACGCC
GAGGGCCGGC AGGTGCTGGT CCGGGTCGGC CGGTACGGCC CGTACCTGGA GCGGATGGTG
CAGGGCGAGG ACGGCGAGCC GACCGCCCAG CGGGCCAACC TGCCCGAGGA CCTGCCCCCG
GACGAGGTCG ACGCCGAGGT CGCCGAGAAG CTGTTCAGCC AGTCCGGCGA CGGTGGCGAG
ACCGAGCTCG GGGTGGATCC GGACACCGGG CACCTGATCG TCGCCAAGGA CGGCCGGTTC
GGCCCCTACG TCACCGAGGT GCTGCCGGAG GCCGCTCCGG CGGCCACCGG GGCCGACGGG
ACCGCCAAGA AGACGACCAA GGCCAAGGCG GCGGCCAAGC CGCGTACCGC GTCGCTGTTC
AAGTCGATGA CCCTGGACAC CATCGACCTG CCCACCGCGC TACGGCTGCT GTCGCTGCCC
CGGGTGGTCG GCGTCGATCC GGCCGACGGC CAGGAGATCA CCGCGCAGAA CGGCCGGTAC
GGGCCCTACC TGAAGAAGGG CACCGACTCC CGGTCGCTGA CCAGCGAGGA CGCGCTGTTC
GACGTCACCC TGGACGAGGC GCTGGCCCTG TACGCGCAGC CCAAGACCCG CGGCCGGTCC
GCGGCGGCCG CGCCGCCGCT GCGGGAGGTC GGCATCGACC CGTCCGGCGG CAAACCGATG
GTGATCAAGG ACGGCCGGTT CGGGCCGTAC GTCACCGACG GGGAGACCAA CGCCTCCCTG
CGCAAGGGTG ACGAGGTCGA GACCCTCACC GTGGAGCGCG CGGCCGAGCT GCTGGCCGAT
CGACGCGCCC GCGGGCCGGC CCCCAAGCGG GCGACCACCC GCAAGCCGGC GGCGGCCAAG
GCCGGTGCCG CGGCCGGTGG CACCAAGACG GCCACCAAGA CTGCGGCGGC CAAGACGACG
GCCACCAAGA CCGCCACCAA GACTGCTTCC AAGACCACGG CGGCGGCGAA GGCGCGGACC
ACCAAGGCGG CCGGCACCAC CCGCAGCACC ACCCGTCGGA CCGGCCCCGC GGAGTGA
 
Protein sequence
MATGTSASTR NGGVRLVVVE SPSKAKTISG YLGDGYIVES SVGHIRDLPR GAADVPAKYK 
GEPWARLGVD TEHGFEPLYV VSPEKKAQVA KLKSLLAGAD ELYLATDEDR EGEAIAWHLL
ETLKPKVPVK RMVFHEITPA AIREAAANPR ALDENLVDAQ ETRRILDRLY GYEVSPVLWK
KVMPKLSAGR VQSVATRIIV QRERERMAFR SGTYWGLDAL MSPAGTGAEP FKSALSTVDG
RRLAAGRDFD PATGGLKADA DVLLLDEDGA RTLATALAGG TATVTSVEEK PYTRKPYPPF
MTSTLQQEAG RKLGFNSERT MRTAQRLYEN GFITYMRTDS TTLSSSAQEA ARAQARELYG
PEYVPPTPRQ YTRKVKNAQE AHEAIRPAGD NFRTPGQVAN QISGDEYRLY ELIWQRTIAS
QMVDARGLTL SVKIAATARE QECVFSASGR TITFPGFLRA YVETVDAEAG GEADDAERRL
PKLETGQKLD IRDLIPASHV TTPPARYTEP SLIGALQDLG IGRPSTYTSI IRTIIDRGYV
WKKGQALVPS WIAFAVIGLL EQHFSRLVDY NFTAAMEDEL DGIADGRIGR TDWLSAFYFG
GDLGPAGSVG RSGGLKKLVG ERLEDIDARE VNSLPLLTDA EGRQVLVRVG RYGPYLERMV
QGEDGEPTAQ RANLPEDLPP DEVDAEVAEK LFSQSGDGGE TELGVDPDTG HLIVAKDGRF
GPYVTEVLPE AAPAATGADG TAKKTTKAKA AAKPRTASLF KSMTLDTIDL PTALRLLSLP
RVVGVDPADG QEITAQNGRY GPYLKKGTDS RSLTSEDALF DVTLDEALAL YAQPKTRGRS
AAAAPPLREV GIDPSGGKPM VIKDGRFGPY VTDGETNASL RKGDEVETLT VERAAELLAD
RRARGPAPKR ATTRKPAAAK AGAAAGGTKT ATKTAAAKTT ATKTATKTAS KTTAAAKART
TKAAGTTRST TRRTGPAE