Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nham_2437 |
Symbol | |
ID | 4029271 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrobacter hamburgensis X14 |
Kingdom | Bacteria |
Replicon accession | NC_007964 |
Strand | + |
Start bp | 2696986 |
End bp | 2699796 |
Gene Length | 2811 bp |
Protein Length | 936 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637970896 |
Product | DNA topoisomerase I |
Protein accession | YP_577687 |
Protein GI | 92117958 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01051] DNA topoisomerase I, bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0547767 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCGCCC CGAAACCGTC CGCGCATCTC TCGCGGAACC GGCCTCGAGT TTTCCCATTA AGCTATTGGA ATAACATGAA TATCGTCATT GTTGAGTCGC CTGCAAAGGC CAAGACGATC AACAAGTATC TAGGCGCTTC CTACGAGGTG CTGGCTTCGT TCGGCCACGT CCGCGACCTC CCCGCTAAAA ACGGATCGGT CGATCCGGAT GCCAATTTCC AGATGATCTG GGAGGTCGAT CCCAAAGCCG TCGGCCGACT CAACGACATC GCCAAATCGC TCAAGGGCGC CGACAAGCTC ATCCTCGCCA CCGACCCCGA TCGCGAGGGC GAGGCGATCT CCTGGCATGT GCTCGAGGTG CTGAAGCAGA AGCGCGCGCT GAAGGACCAG AAGATCGAGC GCGTCGTCTT CAACGCCATC ACCAAGCAGG CCGTCACCGA CGCGATGAAG AATCCTCGCC AGATTGACGG CGCGCTGGTC GACGCCTACA TGGCGCGCCG TGCGCTCGAC TATCTGGTCG GCTTCACCCT CTCCCCGGTG TTGTGGCGCA AGCTGCCCGG CGCCCGCTCG GCCGGGCGCG TGCAGTCGGT CGCCTTGCGG CTGGTCTGCA GCCGCGAGAT CGAGATCGAG AAATTCGTCC CGCGCGAATA CTGGTCGCTG ATCGCGACCC TGGCGACGCC GCGCGGCGAC ACCTTCGAAG CTCGCCTGGT CGGCGCCGAC GGCAAGAAAA TTCAGCGGCT CGATATCGGC AGCGGCGCCG AGGCCGAGGA TTTCAGGAAG GCGATCGAGG CCGCCAATTT CATCGTCTCG ACCGTCGAGG CGAAACCGGC GCGCCGCAAT CCGCAGGCGC CATTCACCAC TTCGACGCTG CAGCAGGAGG CGAGCCGCAA GCTCGGCTTC GCACCCGCGC ATACGATGCG GATCGCGCAA CGACTCTATG AAGGCATCGA CATCGGCGGC GAGACCACCG GCCTCATCAC TTACATGCGA ACCGACGGCG TGCAGATCGA CGGCTCGGCG ATCACGCAAG CGCGCAAGGT GATCGGCGAG GATTATGGCA ACGCCTACGT GCCGGACGCA CCGCGCCAGT ATCAGACCAA GGCCAAGAAC GCGCAGGAAG CCCATGAGGC GATCCGCCCG ACCGACCTGT CGCGCCGTCC CGCGGACATG AAGCGGCGTC TCGATGCCGA TCAGGCGAGA CTCTATGAGC TGATCTGGAT CCGGACCATC GCCAGCCAGA TGGAATCGGC GGAACTGGAA CGCACCACCG TCGACATCGC GGCCAGGGCC GGATCGCGCG TGCTGGAACT GCGCGCCAGC GGCCAGGTCA TCAAGTTCGA CGGCTTCCTG GCGCTATATC AGGAAGGCCG CGACGACGAG GAAGACGAGG ACAGCCGTCG TCTTCCCGCC ATGAGCGACG GCGAAGCGTT GAGGCGCGAG AACCTCAGCG TCACCCAGCA TTTCACCGAA CCGCCACCGC GCTTCTCGGA AGCCTCGCTG GTCAAGCGCA TGGAGGAGCT CGGCATCGGC CGACCCTCGA CCTATGCCTC GATCCTGCAG GTGCTCAAGG ATCGCGGCTA CGTCAAACTC GAGAAGAAAC GGCTCTATGG CGAAGACAAG GGCCGCGTCG TCGTCGCGTT CCTGGAGAAT TTCTTTGCGC GCTACGTCGA ATACGACTTC ACCGCCAACC TCGAGGAGCA GCTCGATCGC ATCTCGAATA ACGAGGTGTC GTGGCAGCAG GTCCTGAAGG ATTTCTGGAA CGACTTCATC GGCGTCGTCA ATGAGATCAA GGACGTCCGC GTTTCCGAAG TGCTCGACGT GCTCGACAAC ATGCTCGGGC CGCACATCTA TGCGCCGCGC GAGGATGGCG GCGACCCCCG CCAGTGCCCC ACCTGCGGGA CCGGAAAACT GAACCTCAAG GCCGGAAAGT TCGGCGCGTT CGTCGGCTGC TCCAACTACC CGGAATGCCG CTACACCCGT CCCCTCGCCG CCGATGGCGA GGCCGGCGGC GACCGCATTC TCGGCAAAGA TCCGGAGAGC GGTCTCGATG TAACGGTGAA GTCCGGCCGC TTCGGACCCT ACATCCAGCT CGGCGAGCCC AGCGATTATG GCGAGGGCGA AAAGCCGAAA CGCGCGGGCA TTCCGAAGAA CACGTCGCCC GCTGACATGG AGCTGGACCT CGCCGTGAAG CTCCTTTCGC TGCCGCGTGA GATCGGTAAG CATCCGGAGA CCGGCGAGCC GATCACCGCC GGCATAGGCC GCTTCGGTCC GTTCGTACGG CATGAAAAAA CCTATGCGAG TCTTGAAGCC GGCGACGAGG TGTTCGACAT CGGCCTCAAC CGCGCGGTGA CGCTGATTGC AGAGAAAATC GCAAAGGGGC CGAGCCGGCG CTTTGGCGCC GATCCCGGCA AGCCGCTCGG CGAGCATCCG ACACTCGGTT CCGTCGCGGT GAAGAGCGGA CGTTATGGCG CCTATGTCAC GGCAGGCGGC GTCAACGCGA CCATCCCGAG CGACAAGACC CAGGACACCA TCACCCTGCC CGAGGCCATC GCGCTGATCG ACGAACGCGC CGCCAAGGGC GGCGGCAAGC CGAAGCGCGC GGCAAAGAAG GCCAAGTCCG CCGGGACTGA AAGGACCGGC AAGGCCGGCG CCGTGGCAGC AAAGCCGGCC AAAAAGGCGG CTGCAAAACA GGCCTCCGTC AAGCCGAAGT CCGCCGCGGT GAGCAAGGCG CGCGGCTCGA TCGCGGCAGT CAAACCGCCG GCCAAGGCTG AAAAAGCGCC CGCAAAGAAG AGTGCTGGCA AGAACGGCTG A
|
Protein sequence | MFAPKPSAHL SRNRPRVFPL SYWNNMNIVI VESPAKAKTI NKYLGASYEV LASFGHVRDL PAKNGSVDPD ANFQMIWEVD PKAVGRLNDI AKSLKGADKL ILATDPDREG EAISWHVLEV LKQKRALKDQ KIERVVFNAI TKQAVTDAMK NPRQIDGALV DAYMARRALD YLVGFTLSPV LWRKLPGARS AGRVQSVALR LVCSREIEIE KFVPREYWSL IATLATPRGD TFEARLVGAD GKKIQRLDIG SGAEAEDFRK AIEAANFIVS TVEAKPARRN PQAPFTTSTL QQEASRKLGF APAHTMRIAQ RLYEGIDIGG ETTGLITYMR TDGVQIDGSA ITQARKVIGE DYGNAYVPDA PRQYQTKAKN AQEAHEAIRP TDLSRRPADM KRRLDADQAR LYELIWIRTI ASQMESAELE RTTVDIAARA GSRVLELRAS GQVIKFDGFL ALYQEGRDDE EDEDSRRLPA MSDGEALRRE NLSVTQHFTE PPPRFSEASL VKRMEELGIG RPSTYASILQ VLKDRGYVKL EKKRLYGEDK GRVVVAFLEN FFARYVEYDF TANLEEQLDR ISNNEVSWQQ VLKDFWNDFI GVVNEIKDVR VSEVLDVLDN MLGPHIYAPR EDGGDPRQCP TCGTGKLNLK AGKFGAFVGC SNYPECRYTR PLAADGEAGG DRILGKDPES GLDVTVKSGR FGPYIQLGEP SDYGEGEKPK RAGIPKNTSP ADMELDLAVK LLSLPREIGK HPETGEPITA GIGRFGPFVR HEKTYASLEA GDEVFDIGLN RAVTLIAEKI AKGPSRRFGA DPGKPLGEHP TLGSVAVKSG RYGAYVTAGG VNATIPSDKT QDTITLPEAI ALIDERAAKG GGKPKRAAKK AKSAGTERTG KAGAVAAKPA KKAAAKQASV KPKSAAVSKA RGSIAAVKPP AKAEKAPAKK SAGKNG
|
| |