Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_2090 |
Symbol | |
ID | 8535249 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | + |
Start bp | 2237151 |
End bp | 2239655 |
Gene Length | 2505 bp |
Protein Length | 834 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 646384468 |
Product | DNA topoisomerase I |
Protein accession | YP_003263955 |
Protein GI | 261856672 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01051] DNA topoisomerase I, bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.475207 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCAAA CGCTTGTTAT CGTTGAGTCG CCCGCCAAAG CGAAAACCAT TAAGAAATAC CTCGGCCCCG GCTACGAGGT GCTCGCCTCC TACGGCCATG TGCGCGACCT GATCCCCAAA GATGGCGCCG TGGATACCGC GCACGACTTT GCGATGAACT ACACCCTGAT CGACAAGAAC GTGCGCCACG TCGATGCCAT TAAAAAAGCG CTTAAATCAT CCGACATTCT CCTGCTCGCA ACCGACCCCG ATCGCGAAGG CGAAGCCATT TCCTGGCATT TGCGCGAACT CTTGGCGGAA GCAGGCTTGC TGAAGAACAA GACGGCGCAA CGGGTGGTGT TCTACGAAAT CACCAAAAAA GCAGTACAGG AAGCGGTCGC GCATCCACGC GATCTTTCCA TCGATCTGAT CAATGCCCAG CAGGCACGCC GCGCACTCGA TTACCTCGTG GGCTTCAACT TGTCGCCGCT CTTGTGGAAA AAGATCAATC CGGGGCTGTC CGCCGGACGG GTGCAAAGCC CCGCCCTGCG CCTGATCGTG GAGCGCGAGG CCGAGATCGA AGCGTTCAAT CCGCAGGAAT ACTGGACCAT ACTCGCCGAC TGCCAGGCCG ATCGGGCTGC CGAAAAGAGC CGATTCAATG CACGCCTGCT GACACTGGAC GGGCAAAAGG CCGAACAATT CACGCTGACC AATGAAACGG ATGCGCAATC TGCGCGCAGC CGTATCCTCG AAGCCGCCGC CGGCACACTC ACCGTCCGTT CGGTGGAAAA GCGCGAACGC AAGCGCAATC CGGCCCCTCC GTTTACCACC TCGACGCTCC AGCAGGAAGG GGTTCGTAAA CTCGGGCTCT CCGCCTCGCG CGTCATGCGC CTGGCGCAGG AGTTGTACGA AGGCGTCGAC ATCGGCGCGG GCACCGTGGG TTTGATTACC TACATGCGAA CCGATGCCGT GACCCTCTCC GAGGACGCGC TCACGCAGAT TCGGGCACAT ATCGGCGATA AGTACGGTGC GGCCTACCTT CCCGCCAGCC CGAACCGCTA CAAGACCAAA TCCAAAAACG CACAAGAAGC GCACGAGGCC ATCCGTCCGA CATCAGCGGC ACATACCCCG GATTCGGTAC GCGCTTTCCT GAACAAGGAT CAGTTCCGCC TGTATGAGAT GATTTTCAAG CGCGCGGTCG CTTCGCAAAT GACACCGGCA GTCTACGATC AAGTTTCCGT TGATCTCGCC GTCAATGATC AGCATAGCTT TCGCGCCAAT GGCTCTACCC TCAAATTCCC GGGCTTCATC GCGCTTTATC GCGAGGATGA AGACGATGCC TCAGGGGACA ATGACGAAGA TCGCCGCCTG CCGCCGCTGA CCGTAGGCGA CAAGATTGCC CTAAACGACA TCGCTGCCGA CCAGCATTTT ACCGAGCCAC CGCCGCGCTT TACCGAGGCG AGTCTGGTCA AGACGCTGGA AGAGTATGGT ATCGGTCGCC CTTCGACCTA TGCCAGCATC ATCTCCACCC TCCAGGCGCG TGAGTACGTT CTGCTCGACC AACGGCGCTT CAAACCCACC GATATGGGCC GGGTCGTTAA TGGCTTTCTG ACCGACTACT TCCGCGATAT TGTCGATTAC GAATTCACCG CCAAACTGGA AGATGATCTG GATGCCGTGT CCCGCGGCGA ACGCGACTGG GTGCCATTGA TGCGCGAGTT CTGGACGCCA TTCCATGACC GTGTCGAGCA CACCAATGAG AATGTCACCC GGCAGGAGGC CGCCCAGGGG CGTGAACTGG GGATCGATCC CAAATCGGGC AAGCCCGTCT CCGTCCGACT CGGGCGGTTC GGTCCCTTCG CCCAGATCGG CACCAAGGAC GACGAGGAAA AACCAAAGTT CGCCTCGCTC AAGCGCAGCC AGAGCATTGC CACCATCACG CTGGATGAAG CCCTGGACCT GTTTCAGTTG CCGAGAAAAC TGGGCGAAAC CCCGGAAGGC GAACCTGTCG AGGTCGCCAT TGGTCGGTTC GGCCCCTTTG TAAAGTTCGG CAAAATGTAC GCTTCGCTTG GCAAGGATGA CGATCCGTAC ACCATCGAAC TGCCGCGCGC GCTCGAAATC ATCGAGATCA AAAAGCTCGC TGAGAAGAAT CGCTACATCA CCCAGTTCGA TAATGGGGTG TCGGTACAAA ACGGTCGCTA TGGCCCCTAC ATCACCGATG GCAAAAAGAA CGCCAAAATC CCCAAGGACA AAGATCCAAA ATCCCTGACA CTGGAAGAGT GCGTCGCCTT GCTTGCAGCT GCCCCTGAGA AGAAATCGGC GCGCGGCAAA ACCGCTGCGA AAGCGACGGC CAAGAAGACG GCAGCCCCAA AAGCGACGGC TCCGAAAACA ACAGCCAAAA AGTCAACGAC GCCGCGCGCC AAAAAGGCCA GCGCGACCGA TGCACAAGCG CCTGCAAAGA AGACGCCGGT AAAAAAGCCC GCCACCGCTC GCAGCAAGAA ACCCGCCGCG ACGATACCGG AATAA
|
Protein sequence | MSQTLVIVES PAKAKTIKKY LGPGYEVLAS YGHVRDLIPK DGAVDTAHDF AMNYTLIDKN VRHVDAIKKA LKSSDILLLA TDPDREGEAI SWHLRELLAE AGLLKNKTAQ RVVFYEITKK AVQEAVAHPR DLSIDLINAQ QARRALDYLV GFNLSPLLWK KINPGLSAGR VQSPALRLIV EREAEIEAFN PQEYWTILAD CQADRAAEKS RFNARLLTLD GQKAEQFTLT NETDAQSARS RILEAAAGTL TVRSVEKRER KRNPAPPFTT STLQQEGVRK LGLSASRVMR LAQELYEGVD IGAGTVGLIT YMRTDAVTLS EDALTQIRAH IGDKYGAAYL PASPNRYKTK SKNAQEAHEA IRPTSAAHTP DSVRAFLNKD QFRLYEMIFK RAVASQMTPA VYDQVSVDLA VNDQHSFRAN GSTLKFPGFI ALYREDEDDA SGDNDEDRRL PPLTVGDKIA LNDIAADQHF TEPPPRFTEA SLVKTLEEYG IGRPSTYASI ISTLQAREYV LLDQRRFKPT DMGRVVNGFL TDYFRDIVDY EFTAKLEDDL DAVSRGERDW VPLMREFWTP FHDRVEHTNE NVTRQEAAQG RELGIDPKSG KPVSVRLGRF GPFAQIGTKD DEEKPKFASL KRSQSIATIT LDEALDLFQL PRKLGETPEG EPVEVAIGRF GPFVKFGKMY ASLGKDDDPY TIELPRALEI IEIKKLAEKN RYITQFDNGV SVQNGRYGPY ITDGKKNAKI PKDKDPKSLT LEECVALLAA APEKKSARGK TAAKATAKKT AAPKATAPKT TAKKSTTPRA KKASATDAQA PAKKTPVKKP ATARSKKPAA TIPE
|
| |