Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1803 |
Symbol | |
ID | 3918362 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 1899884 |
End bp | 1902463 |
Gene Length | 2580 bp |
Protein Length | 859 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640444544 |
Product | DNA topoisomerase I |
Protein accession | YP_497077 |
Protein GI | 87199820 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01051] DNA topoisomerase I, bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.120095 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGCTTG TCATCGTAGA ATCCCCCGCC AAGGCCAAGA CCATCGAGAA GTACCTCGGA CCTGGCTACA AGGTGCTGGC CTCCTACGGC CACGTCCGCG ATCTCCCCGT GAAGGATGGG TCCGTCCGCC CGGACGAGGA TTTCGCCATG GATTGGGAGC TTTACGGGGA CAAGCAGGCC CGCGTGAAGG CGATCTCCGA TGCGGCCAAG GGCGCGGACC GACTGATCCT CGCGACCGAC CCTGACCGCG AGGGCGAAGC CATTTCGTGG CACGTCAGGG AACTGCTGGC GAAACGACGC GTGCTTCCGA AGGAAGTCGA ACGCGTTACT TTCAACGCCA TCACCAAGCA GACCGTGACC GACGCGATGA AGAAGCCGCG TGCGCTCGAC CAGGACCTGA TCGACGCGTA CCTGGCTCGC CGCGCGCTGG ACTATCTGTT TGGGTTCACG CTTTCGCCGG TCCTGTGGCG CAAGCTGCCC GGCGCCAAGT CGGCCGGGCG CGTCCAGTCG GTCGCCCTGC GCCTCATCGT CGAGCGCGAG CGCGAGATCG AGGCCTTCCG CGCGCAGGAA TACTGGTCCG TCATCGCCCG GCTCGAACAT GCCGGCACCG AGTTTGCGGC GCGGCTCGTA AAGTTCGACG GGCAGAAGCT CGACAAGCTG ACGCTGGGCG ATGAAGGCGC GGCGATGAGG GCCAAGGGCG TGGTCGAGGC CGCGACTTTC CGCGTCGAGG AAGTCGAGAC AAAGCCCACG CGCCGCAACC CCTATCCGCC GTTCACCACA TCGACCCTGC AACAGGAAGC CGCGCGCAAG CTCGGCTTCT CGGCCAGCCA CACCATGCGC GTGGCGCAGA CGCTCTACGA AGCGGGCGCG ATCACCTACA TGCGTACCGA TGGCGTGCAG ATGGACCCGA GCGCCATTTC GGCCGCGCGC AAGGCCATCT CGGATCGTTA CGACGGCCAC TACCTGCCCG AAAAGCCCCG CCATTACGAA ACTAAGGCCA AGAACGCGCA GGAAGCCCAC GAGGCGATCC GGCCCACGGA TTTCTCGAAG GACCGTTACG GGTCCGCCGA CGAGGCGCGG CTCTACGATC TGATCTACAA GCGTGCGATG GCCAGCCAGA TGGCGTCGGC CAATATCGAG CGGACGACCG TTACGTTGCG CGACGGCACC GGAAAGCACG AACTGCGCGC CACTGGCCAG GTGGTGAAGT TCCCCGGCTT CCTCGCCGTC TATGAGGAAG GTCGCGACCA GAAGCAGGAT GGCGACGAAG AGGATGGCAG CCTGCTTCCC GCGATGGCAT CGGGCGATAC CCCGGCCAAG CGCGGGGTCG ATGCCACCCA GCACTTCACC CAGCCGCCGC CGCGCTATTC GGAAGCGAGC CTCGTCAAGC GGCTCGAGGA ACTCGGCATC GGCCGTCCTT CGACCTACGC TGCAACCCTT CAGGTGTTGA AGGACCGCAA CTACGTCAGG ACCGAGAAAA ACCGTTTCTT CGCAGAGGAA TCCGGACGAC TTCTCACAGC ATTTCTCGAG CGATTCTTCC CTCGCTACGT GGCTTACGAG TTCACGGCGG GCATGGAGGA GGAGCTGGAC GACGTGTCCG GTGGACGCGC CGAATGGAAG AAGGTGCTTG CCGACTTCTG GCGCGACTTC AAGCCCAAGT CCGACGAGGT GATGGAGAAG AAGCCGAGCG AGGTTACCGC CGAGCTTGAC GAGTTTCTCT CCGACTACCT CTTCCCCCCG CGCGCCGATG GCACCGACCC GCGCCTCTGC CCCAAGTGCG AAGAGGGCCG TCTGTCGCTG CGCGGCGGGC GTTACGGCGC TTTCGTGGCC TGCTCGAACT ATCCCGAATG CAAGTATACC CGCAAGTTCG CGCAGCCTGG CGGAGCCGAT GGCGACGGCG GCGAGGATGG TGCGCTGGGC AAGGATCCAG AGACGGGTCT GGAAGTCGTG CGCAAGGCAG GCCGATTCGG CCCCTACATC CAGCTTGGTG ACGGCAAGGA GGCCAAGCGC GCCTCTATCC CCAAGGACAT TGGCGAACTC GATCTGGAAT GGGCGCTCAA GCTGCTGTCA CTGCCGCGTG AAGTGGGCGT GCATCCGGAA ACCGGAAACC CGATCACCGC CAGCATCGGG CGTTATGGCC CCTATCTGGC GCACGACGGC AAGTATGCGC GGCTCAAGTC CACTGCCGAG GTGTTCGAGA CGGGCATGAA CGCGGCGGTC ATGCTGCTGG CTGCTGCAGC GGCCGGGGCG GGCGCGCGCG GCAGCCGTGC CGCGGCCGAA CCCCTCAAGG TCTTCGGCGC GCATCCCACT TCCGGGGGTG AGATCAAGCT GATGGCAGGG CGCTATGGCC CTTACGTCAC CGACGGCACC ACCAACGCCA CCCTCCCGCG CGACAAGCAG CCCGAGGCGC TGACGCTGGA AGAGGCGATC ACGCTCATCG ACGAACGTGC CGCCAAAGGC CCGGCCAAGG GCAAGAAGAA GGCCCCTGCC AAGAAGGCTT CGGCCGCGAA AAAGCCTCCC GCCGCGAAGA AGGCTCCGGC CAAGAAGGCG GCACCGAAAA AGGCCGCTGC TGCGGAATGA
|
Protein sequence | MQLVIVESPA KAKTIEKYLG PGYKVLASYG HVRDLPVKDG SVRPDEDFAM DWELYGDKQA RVKAISDAAK GADRLILATD PDREGEAISW HVRELLAKRR VLPKEVERVT FNAITKQTVT DAMKKPRALD QDLIDAYLAR RALDYLFGFT LSPVLWRKLP GAKSAGRVQS VALRLIVERE REIEAFRAQE YWSVIARLEH AGTEFAARLV KFDGQKLDKL TLGDEGAAMR AKGVVEAATF RVEEVETKPT RRNPYPPFTT STLQQEAARK LGFSASHTMR VAQTLYEAGA ITYMRTDGVQ MDPSAISAAR KAISDRYDGH YLPEKPRHYE TKAKNAQEAH EAIRPTDFSK DRYGSADEAR LYDLIYKRAM ASQMASANIE RTTVTLRDGT GKHELRATGQ VVKFPGFLAV YEEGRDQKQD GDEEDGSLLP AMASGDTPAK RGVDATQHFT QPPPRYSEAS LVKRLEELGI GRPSTYAATL QVLKDRNYVR TEKNRFFAEE SGRLLTAFLE RFFPRYVAYE FTAGMEEELD DVSGGRAEWK KVLADFWRDF KPKSDEVMEK KPSEVTAELD EFLSDYLFPP RADGTDPRLC PKCEEGRLSL RGGRYGAFVA CSNYPECKYT RKFAQPGGAD GDGGEDGALG KDPETGLEVV RKAGRFGPYI QLGDGKEAKR ASIPKDIGEL DLEWALKLLS LPREVGVHPE TGNPITASIG RYGPYLAHDG KYARLKSTAE VFETGMNAAV MLLAAAAAGA GARGSRAAAE PLKVFGAHPT SGGEIKLMAG RYGPYVTDGT TNATLPRDKQ PEALTLEEAI TLIDERAAKG PAKGKKKAPA KKASAAKKPP AAKKAPAKKA APKKAAAAE
|
| |