Gene Saro_1803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1803 
Symbol 
ID3918362 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1899884 
End bp1902463 
Gene Length2580 bp 
Protein Length859 aa 
Translation table11 
GC content65% 
IMG OID640444544 
ProductDNA topoisomerase I 
Protein accessionYP_497077 
Protein GI87199820 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.120095 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCTTG TCATCGTAGA ATCCCCCGCC AAGGCCAAGA CCATCGAGAA GTACCTCGGA 
CCTGGCTACA AGGTGCTGGC CTCCTACGGC CACGTCCGCG ATCTCCCCGT GAAGGATGGG
TCCGTCCGCC CGGACGAGGA TTTCGCCATG GATTGGGAGC TTTACGGGGA CAAGCAGGCC
CGCGTGAAGG CGATCTCCGA TGCGGCCAAG GGCGCGGACC GACTGATCCT CGCGACCGAC
CCTGACCGCG AGGGCGAAGC CATTTCGTGG CACGTCAGGG AACTGCTGGC GAAACGACGC
GTGCTTCCGA AGGAAGTCGA ACGCGTTACT TTCAACGCCA TCACCAAGCA GACCGTGACC
GACGCGATGA AGAAGCCGCG TGCGCTCGAC CAGGACCTGA TCGACGCGTA CCTGGCTCGC
CGCGCGCTGG ACTATCTGTT TGGGTTCACG CTTTCGCCGG TCCTGTGGCG CAAGCTGCCC
GGCGCCAAGT CGGCCGGGCG CGTCCAGTCG GTCGCCCTGC GCCTCATCGT CGAGCGCGAG
CGCGAGATCG AGGCCTTCCG CGCGCAGGAA TACTGGTCCG TCATCGCCCG GCTCGAACAT
GCCGGCACCG AGTTTGCGGC GCGGCTCGTA AAGTTCGACG GGCAGAAGCT CGACAAGCTG
ACGCTGGGCG ATGAAGGCGC GGCGATGAGG GCCAAGGGCG TGGTCGAGGC CGCGACTTTC
CGCGTCGAGG AAGTCGAGAC AAAGCCCACG CGCCGCAACC CCTATCCGCC GTTCACCACA
TCGACCCTGC AACAGGAAGC CGCGCGCAAG CTCGGCTTCT CGGCCAGCCA CACCATGCGC
GTGGCGCAGA CGCTCTACGA AGCGGGCGCG ATCACCTACA TGCGTACCGA TGGCGTGCAG
ATGGACCCGA GCGCCATTTC GGCCGCGCGC AAGGCCATCT CGGATCGTTA CGACGGCCAC
TACCTGCCCG AAAAGCCCCG CCATTACGAA ACTAAGGCCA AGAACGCGCA GGAAGCCCAC
GAGGCGATCC GGCCCACGGA TTTCTCGAAG GACCGTTACG GGTCCGCCGA CGAGGCGCGG
CTCTACGATC TGATCTACAA GCGTGCGATG GCCAGCCAGA TGGCGTCGGC CAATATCGAG
CGGACGACCG TTACGTTGCG CGACGGCACC GGAAAGCACG AACTGCGCGC CACTGGCCAG
GTGGTGAAGT TCCCCGGCTT CCTCGCCGTC TATGAGGAAG GTCGCGACCA GAAGCAGGAT
GGCGACGAAG AGGATGGCAG CCTGCTTCCC GCGATGGCAT CGGGCGATAC CCCGGCCAAG
CGCGGGGTCG ATGCCACCCA GCACTTCACC CAGCCGCCGC CGCGCTATTC GGAAGCGAGC
CTCGTCAAGC GGCTCGAGGA ACTCGGCATC GGCCGTCCTT CGACCTACGC TGCAACCCTT
CAGGTGTTGA AGGACCGCAA CTACGTCAGG ACCGAGAAAA ACCGTTTCTT CGCAGAGGAA
TCCGGACGAC TTCTCACAGC ATTTCTCGAG CGATTCTTCC CTCGCTACGT GGCTTACGAG
TTCACGGCGG GCATGGAGGA GGAGCTGGAC GACGTGTCCG GTGGACGCGC CGAATGGAAG
AAGGTGCTTG CCGACTTCTG GCGCGACTTC AAGCCCAAGT CCGACGAGGT GATGGAGAAG
AAGCCGAGCG AGGTTACCGC CGAGCTTGAC GAGTTTCTCT CCGACTACCT CTTCCCCCCG
CGCGCCGATG GCACCGACCC GCGCCTCTGC CCCAAGTGCG AAGAGGGCCG TCTGTCGCTG
CGCGGCGGGC GTTACGGCGC TTTCGTGGCC TGCTCGAACT ATCCCGAATG CAAGTATACC
CGCAAGTTCG CGCAGCCTGG CGGAGCCGAT GGCGACGGCG GCGAGGATGG TGCGCTGGGC
AAGGATCCAG AGACGGGTCT GGAAGTCGTG CGCAAGGCAG GCCGATTCGG CCCCTACATC
CAGCTTGGTG ACGGCAAGGA GGCCAAGCGC GCCTCTATCC CCAAGGACAT TGGCGAACTC
GATCTGGAAT GGGCGCTCAA GCTGCTGTCA CTGCCGCGTG AAGTGGGCGT GCATCCGGAA
ACCGGAAACC CGATCACCGC CAGCATCGGG CGTTATGGCC CCTATCTGGC GCACGACGGC
AAGTATGCGC GGCTCAAGTC CACTGCCGAG GTGTTCGAGA CGGGCATGAA CGCGGCGGTC
ATGCTGCTGG CTGCTGCAGC GGCCGGGGCG GGCGCGCGCG GCAGCCGTGC CGCGGCCGAA
CCCCTCAAGG TCTTCGGCGC GCATCCCACT TCCGGGGGTG AGATCAAGCT GATGGCAGGG
CGCTATGGCC CTTACGTCAC CGACGGCACC ACCAACGCCA CCCTCCCGCG CGACAAGCAG
CCCGAGGCGC TGACGCTGGA AGAGGCGATC ACGCTCATCG ACGAACGTGC CGCCAAAGGC
CCGGCCAAGG GCAAGAAGAA GGCCCCTGCC AAGAAGGCTT CGGCCGCGAA AAAGCCTCCC
GCCGCGAAGA AGGCTCCGGC CAAGAAGGCG GCACCGAAAA AGGCCGCTGC TGCGGAATGA
 
Protein sequence
MQLVIVESPA KAKTIEKYLG PGYKVLASYG HVRDLPVKDG SVRPDEDFAM DWELYGDKQA 
RVKAISDAAK GADRLILATD PDREGEAISW HVRELLAKRR VLPKEVERVT FNAITKQTVT
DAMKKPRALD QDLIDAYLAR RALDYLFGFT LSPVLWRKLP GAKSAGRVQS VALRLIVERE
REIEAFRAQE YWSVIARLEH AGTEFAARLV KFDGQKLDKL TLGDEGAAMR AKGVVEAATF
RVEEVETKPT RRNPYPPFTT STLQQEAARK LGFSASHTMR VAQTLYEAGA ITYMRTDGVQ
MDPSAISAAR KAISDRYDGH YLPEKPRHYE TKAKNAQEAH EAIRPTDFSK DRYGSADEAR
LYDLIYKRAM ASQMASANIE RTTVTLRDGT GKHELRATGQ VVKFPGFLAV YEEGRDQKQD
GDEEDGSLLP AMASGDTPAK RGVDATQHFT QPPPRYSEAS LVKRLEELGI GRPSTYAATL
QVLKDRNYVR TEKNRFFAEE SGRLLTAFLE RFFPRYVAYE FTAGMEEELD DVSGGRAEWK
KVLADFWRDF KPKSDEVMEK KPSEVTAELD EFLSDYLFPP RADGTDPRLC PKCEEGRLSL
RGGRYGAFVA CSNYPECKYT RKFAQPGGAD GDGGEDGALG KDPETGLEVV RKAGRFGPYI
QLGDGKEAKR ASIPKDIGEL DLEWALKLLS LPREVGVHPE TGNPITASIG RYGPYLAHDG
KYARLKSTAE VFETGMNAAV MLLAAAAAGA GARGSRAAAE PLKVFGAHPT SGGEIKLMAG
RYGPYVTDGT TNATLPRDKQ PEALTLEEAI TLIDERAAKG PAKGKKKAPA KKASAAKKPP
AAKKAPAKKA APKKAAAAE