Gene Saro_2086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2086 
Symbol 
ID3917734 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2223045 
End bp2225045 
Gene Length2001 bp 
Protein Length666 aa 
Translation table11 
GC content66% 
IMG OID640444839 
ProductDNA topoisomerase IV subunit B 
Protein accessionYP_497359 
Protein GI87200102 
COG category[L] Replication, recombination and repair 
COG ID[COG0187] Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), B subunit 
TIGRFAM ID[TIGR01055] DNA topoisomerase IV, B subunit, proteobacterial 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0442115 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGACG ACCTTTTCGA CAAGCTCCCC GCCACCAGCG CCGCCGAAGC CTATGACGGT 
TCCGCGATCG AGGTTCTCGA AGGCCTCGAA CCGGTCCGCC GCCGCCCCGG CATGTACATC
GGCGGCACGG ACGAACGCGC GCTGCACCAC CTCGCCGCAG AAGTGCTCGA TAACGCGATG
GACGAAGCGG TCGCTGGCCA CGCCAACCGC ATCGAGGTCC TGCTCGAGGA AGGCAACCGG
CTGACCATCT CCGACAACGG TCGCGGCATC CCGGTCGACG AACATCCCAA GTATCCGGGC
AAGTCGGCAC TCGAAGTGAT CCTCACCACG CTCCACTCGG GTGGCAAGTT CTCGGGCAAG
GCCTATGCAA CCTCGGGCGG CCTCCACGGC GTCGGCGTCT CGGTCGTCAA CGCGCTGTCC
TCGCTCACCC GCGTCGAAGT AGCCCGCAAC AAGGAACTCT ACGCGCAGGA ATTTTCGCGC
GGGAACCCGA CGACGAAGCT GATGAAGGTC GGCAACGCCC CCAACCGGCG CGGCACCCAG
GTCACCTTCA TCCCCGACAC CGAGATCTTC GGAGAGGACG CGAAGTTCAA GCCTGCGCGC
CTGTTCCGGC TGGTCCGCTC CAAGGCCTAC CTCTTCGCCG GCGTCGAGAT CCGCTGGAAG
TGCGAGCCCT CGCTCGCCAG CGACGACGTG CCCGCCGAAG CCGTGTTCCA GTTCCCCGGC
GGCCTGGCCG ATCATCTGGC CGAACAGGTC TCGGGCCGCG AATGCGTCAC TTCGGTCCCC
TTCGCGGGTC GCCAGGATTT CCCGCTCGGT CCCGATGGCG AGGAAATGGG CCGCGTCGAA
TGGGCGATTG CCTGGCCGCT GTGGTCCGAC GGCTCCTACT CGTGGTACTG CAATACCATT
CCCACGCCCG ATGGCGGCAC CCATGAACAG GGCCTGCGCG CGGCACTGAC CAAGGGCATC
CGCGCCTTCG CGGACCTCAT CGGCCAGAAA AAGGCCAAGG ACATCGCGCC GGAAGACATG
ATCACCGGCA GCGAGATCAT GCTCTCGGTC TTCATCCGCG ATCCCCAGTT CCAGAGCCAG
ACCAAGGACC GCCTGACCAG CCCCGAAGCC GCGCGCATGG TCGAGGCCGC CGTGCGCGAC
CATTTCGACC ACTTCCTCAC CGACAACATG GAGCGCGGCA AGGCCCTGCT CGGCGCGGTG
ATGGAGCGCA TGGACGAACG CCTGCGCCGC AAGGCGGAGC GTGAGGTCAA GCGCAAGACC
GCGACCAACG CCCGCAAGCT GCGCCTCCCC GGCAAGCTCA CCGACTGCTC GGGCGAAGGC
AGCGGAGAGA CCGAACTGTT CATCGTCGAA GGCGATTCGG CAGGCGGCAG CGCCAAGCAG
GCACGCGACC GCAAGACCCA GGCGATCCTG CCGATCCGCG GCAAGATCCT CAACGTCGCA
AGCGCCACCG CCGACAAGAT CCGCGCCAAC CAGGAAATCG CCGACCTCGC GCTCGCACTC
GGCTGCGGCA TGCGCAAGGA CTGCAATCCC GATGCTCTAC GCTATGACCG CGTCATCATC
ATGACCGACG CCGACGTCGA CGGCGCACAC ATCGCCACGC TGCTGATGAC GTTCTTCTTC
CAGGAGATGC CCGAGCTCGT CAGGCGCGGT CATCTCTACC TCGCCCAGCC GCCGCTCTAC
CGCCTGACCA GCGGCAGCAC GTCGGCCTAC GCCAAGGACG ACGCCCACCG CGCCGAACTC
GAAGCCACGA AGTTCAAGGG CAAGAAGGTC GAAGTCGGGC GCTTCAAGGG CCTCGGCGAA
ATGAACCCTC AGCAGTTGCG CGAAACAACC ATGGCGCCTG CCACCCGCAG CCTCATCCGC
ATCACCCTGC CGCCCGAGTA CGAAGGCCGC GCGGCAGTGA AGGACCTGGT CGACAGGCTG
ATGGGCCGCG ACCCGGCACA ACGCTTCATG TTCATCCAGA ACCGCGCGAG CGAGATCGAT
CCGGAACTGA TCGACGCGTG A
 
Protein sequence
MSDDLFDKLP ATSAAEAYDG SAIEVLEGLE PVRRRPGMYI GGTDERALHH LAAEVLDNAM 
DEAVAGHANR IEVLLEEGNR LTISDNGRGI PVDEHPKYPG KSALEVILTT LHSGGKFSGK
AYATSGGLHG VGVSVVNALS SLTRVEVARN KELYAQEFSR GNPTTKLMKV GNAPNRRGTQ
VTFIPDTEIF GEDAKFKPAR LFRLVRSKAY LFAGVEIRWK CEPSLASDDV PAEAVFQFPG
GLADHLAEQV SGRECVTSVP FAGRQDFPLG PDGEEMGRVE WAIAWPLWSD GSYSWYCNTI
PTPDGGTHEQ GLRAALTKGI RAFADLIGQK KAKDIAPEDM ITGSEIMLSV FIRDPQFQSQ
TKDRLTSPEA ARMVEAAVRD HFDHFLTDNM ERGKALLGAV MERMDERLRR KAEREVKRKT
ATNARKLRLP GKLTDCSGEG SGETELFIVE GDSAGGSAKQ ARDRKTQAIL PIRGKILNVA
SATADKIRAN QEIADLALAL GCGMRKDCNP DALRYDRVII MTDADVDGAH IATLLMTFFF
QEMPELVRRG HLYLAQPPLY RLTSGSTSAY AKDDAHRAEL EATKFKGKKV EVGRFKGLGE
MNPQQLRETT MAPATRSLIR ITLPPEYEGR AAVKDLVDRL MGRDPAQRFM FIQNRASEID
PELIDA