Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_4545 |
Symbol | |
ID | 8605906 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | - |
Start bp | 5151414 |
End bp | 5154194 |
Gene Length | 2781 bp |
Protein Length | 926 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | DNA topoisomerase I |
Protein accession | YP_003302110 |
Protein GI | 269128740 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCCAGCCA AGAACGGCAC AGCACAGCGG GGCCGGGGTC GGGGCCGGGC CGCGGCGTCC AGTGGCACGG GCACACGCCT GGTGATCGTC GAGTCGCCCG CCAAGGCCAA GACGATCGCC GGTTACCTGG GCAGCGGCTA TGTCGTGGAG TCCAGTATCG GCCACATCCG CGACATCCCG CGGCCCGCCG ACATGCCCGA GGAGATCAAG AACGAGCCTT GGGCCAAGCT CGGCGTGAAC GTCGACAAGG ACTTCGAGCC GTACTACGAG GTCACCCCGG ACAAAAAGAG CCAGGTCGCC AAGCTGCGCA AGGCGCTGAA GGAGGCCGAC GAGCTCTACC TGGCCACCGA CGAGGACCGC GAGGGGGAGG CGATCGCCTG GCACCTGCGC GAGGTGCTCA AGCCCAAGAT CCCGGTGCAC CGCATGGTGT TCAACGAGAT CACCCCCGAG GCGATCCGGC ACGCCGCCGC CAACCCCCGC GAGCTGGACC TGAAGCTGGT GGACGCCCAG GAGACCCGGC GCATCCTGGA CCGCCTCTAC GGCTTCGAAG TCAGCCCGGT GCTGTGGCGC AAGATCATGC AGGGCCTGTC GGCCGGCCGG GTGCAGTCGG TGGCCACCCG CCTGGTGGTG GAGCGCGAAC GCGAGCGCAT CGCGTTCGTC CCCGCCCACT ACTGGGACAT CGAGGCCGAC TTCGCCGTCG CCGCCGCCGA CGAGGAGGGC GGCCTCCGCT CGTTCGTCGC GCACCTGGTC GGGGTGGACG GCAAGCGCAT CGCCCAGGGC CGTGACTTCT CCTCCCGGGG CGAGCTGAAG TCCGCCGAGC TGCTGCACCT GGACGAGGAG GCGGCGCGCG GCCTGGCCGA GCGGCTGTCC GGGCGTCCCT TCACGGTCAC CTCCGTCGAG CGCAAGCCCT ACACCCGCAA GCCGTACGCG CCGTTCCGCA CCACCACCCT GCAGCAGGAG GCCAGCCGCA AGCTGGGCTT CTCGGCCAAG TACACGATGC AGGTGGCGCA GCGGCTGTAT GAGAACGGCT TCATCACCTA CATGCGGACC GACAGCACCA ACCTGTCGGA GACCGCCCTG GCCGCCGCCC GCGCCCAGGC CACCGCGCTG TACGGCCCCG AGTACGTCGC CGACAAGCCC CGCATCTACG CCACGAAGGT CAAGAACGCC CAGGAGGCGC ACGAGGCGAT CCGCCCCGCC GGGGACGTCT TCCGCACCCC GGCCGAGACC GGGCTGACCG GCGACCAGTT CCGGCTGTAT GAGCTGATCT GGAAACGCAC CATCGCCTCC CAGATGAAGG ACGCCGTCGG CCAGTCGGTG TCGGTGCGGG TGGAGGGCTC CTCCAGCGCC GGCGAGCGGG CCGAGTTCGC CGCCACCGGC AAGACCATCA CCTTCTACGG CTTCCTGAAG GCGTATGTGG AGGGCGCCGA CGACCCCGAG TCCGACGCCG ACGACAGCGA GCGGCGCCTG CCGGCCCTGG CCGAGGGCGA CCCGCTGAGC GCCGAGCGGC TGCAGGCCCA GGGGCACAGC ACCCGCCCGC CCGCCCGCTA CACCGAGGCC ACCTTGGTCA AGGAGCTGGA GGAGCGGGAG ATCGGCCGGC CCTCCACCTA CGCCACGATC CTGGGGACGA TCCTCGAACG CGGATACGTG TTCAAAAAGG GCACCGCGCT GGTGCCGTCC TTCCTGGCGT TCGCCGTGGT CAACCTGCTG GAGAAGCACT TCGGGCACCT GGTCGACTAC GACTTCACCG CCAAGATGGA GGACATCCTC GACCTGATCG CCCGGGGCGA GGCCGAGCGG GTGCCGTGGC TGCGCCGCTT CTACTTCGGC GACGGCAACG GCGAGGGCGA GGTCGGCCTG AAGGAGCTGG TCACCGATCT GGGCGAGATC GACGCCATCG GGGTCAGCAC GTTCCCGATC CCCGGCAGCG ACATCACCGC CCGGGTCGGC CGGTACGGCC CGTACCTGGT GCGCACCAAG CCGGACGGCA CCGAAGAGCG CGTCAACATC CCCGCCGACC TGGCGCCCGA TGAGCTGACC GCCGAGAAGG CCGAGGAGCT GTTCGCCCAG CCCAGCGGCG ACCGCGAACT GGGCCGCGAC CCCGAGACCG GGCATGCGAT CGTGCTCAAG ACCGGCCGGT TCGGCCCGTA TGTGACCGAG GTGCTGCCCG AGGAGCTGAC CACTACCGCC TCCGGGCGCA AGAAGAAGGA CGCGCCCAAG CCGCGCACCG CCTCGCTGCT GAAGTCGATG CGGCCGGAGA CGGTCACCCT CGAGGACGCG CTGAAGCTGC TGTCGCTGCC GCGCACCCTC GGGGAACTGG ACGGCGAGCC GGTGACCGTG CACAACGGCC GCTTCGGCCC GTACGTCAAA AAGGGCTCCG ACAGCCGCTC CCTGGCCTCC GACGAGGAGC TGTTCACCCT CACCCTGGAG CAGGCCCGGG AGCTGTTCGC CCAGCCCAAG CAGCGCGGCC GGGGCCGGAG TGCGGCATCG GCGCCGCTGC GCGAGCTGGG GGAGGACCCG GCCACCGGCA AGCCGATCGT GGTCAAGGAG GGCCGCTTCG GTCCCTACGT GACCGACGGG GAGACCAACG CCAGCCTCCG CAAGGGGGAT GAGGTGGAGT CCATCACCGT CCAGCGTGCG GCCGAGCTGC TGGCCGAGCG GCGCGAGAAG GTCGCCGCGG GCGGCGGGCG CAAGACCGCC ACGCGGCGCG CCTCGGCCAA GAAGACCGCG ACCGCCAAGA AGACCACGGC CAAGAAGACA TCCGCTCGCT CCGCGACCTG A
|
Protein sequence | MPAKNGTAQR GRGRGRAAAS SGTGTRLVIV ESPAKAKTIA GYLGSGYVVE SSIGHIRDIP RPADMPEEIK NEPWAKLGVN VDKDFEPYYE VTPDKKSQVA KLRKALKEAD ELYLATDEDR EGEAIAWHLR EVLKPKIPVH RMVFNEITPE AIRHAAANPR ELDLKLVDAQ ETRRILDRLY GFEVSPVLWR KIMQGLSAGR VQSVATRLVV ERERERIAFV PAHYWDIEAD FAVAAADEEG GLRSFVAHLV GVDGKRIAQG RDFSSRGELK SAELLHLDEE AARGLAERLS GRPFTVTSVE RKPYTRKPYA PFRTTTLQQE ASRKLGFSAK YTMQVAQRLY ENGFITYMRT DSTNLSETAL AAARAQATAL YGPEYVADKP RIYATKVKNA QEAHEAIRPA GDVFRTPAET GLTGDQFRLY ELIWKRTIAS QMKDAVGQSV SVRVEGSSSA GERAEFAATG KTITFYGFLK AYVEGADDPE SDADDSERRL PALAEGDPLS AERLQAQGHS TRPPARYTEA TLVKELEERE IGRPSTYATI LGTILERGYV FKKGTALVPS FLAFAVVNLL EKHFGHLVDY DFTAKMEDIL DLIARGEAER VPWLRRFYFG DGNGEGEVGL KELVTDLGEI DAIGVSTFPI PGSDITARVG RYGPYLVRTK PDGTEERVNI PADLAPDELT AEKAEELFAQ PSGDRELGRD PETGHAIVLK TGRFGPYVTE VLPEELTTTA SGRKKKDAPK PRTASLLKSM RPETVTLEDA LKLLSLPRTL GELDGEPVTV HNGRFGPYVK KGSDSRSLAS DEELFTLTLE QARELFAQPK QRGRGRSAAS APLRELGEDP ATGKPIVVKE GRFGPYVTDG ETNASLRKGD EVESITVQRA AELLAERREK VAAGGGRKTA TRRASAKKTA TAKKTTAKKT SARSAT
|
| |