Gene Saro_2793 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2793 
Symbol 
ID3916953 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3015091 
End bp3017376 
Gene Length2286 bp 
Protein Length761 aa 
Translation table11 
GC content65% 
IMG OID640445572 
ProductDNA topoisomerase IV subunit A 
Protein accessionYP_498063 
Protein GI87200806 
COG category[L] Replication, recombination and repair 
COG ID[COG0188] Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), A subunit 
TIGRFAM ID[TIGR01062] DNA topoisomerase IV, A subunit, proteobacterial 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGACCG ATACGACAGA ACCCGATCCG TTCGACGCAA TCGTCGATGC TCCGTTCGAT 
TCCGCGCTGT CGGAACGCTA CCTCGTCTAT GCGCTGTCGA CGATCACCGC GCGCTCGCTG
CCCGACCTGC GCGACGGTCT GAAGCCGGTG CACCGCCGCC TGCTGTGGGC GATGCGGCAA
CTGAAGCTGG ATCCGGCGCA GGCGTTCAAG AAATCGGCCC GCGTGGTCGG CGACGTCATC
GGCAAGTATC ACCCCCATGG CGATGCCAGC GTCTATGACG CGATGGTCCG CCTCGCGCAG
GACTGGGCGC TGCGCTATCC GCTGGTCGAG GGGCAGGGCA ACTTCGGCAA CATCGACGGC
GATAACGCCG CGGCCTATCG CTATACCGAA GCCAGGCTGA CGAAGACGGC GATCCAGCTC
ATGTCCGGGC TGGACGAGGG CACCGTCGAC TTCGTGCCCA CCTACAACGG CGAGGAGGAA
GAGCCGGAAA TCTTTCCGGG CCTGTTCCCG AACCTGCTGG CCAATGGATC GAGCGGCATC
GCGGTGGGCA TGGCCACCAA CATCCCGAGC CACAACGTGG CCGAGATCAT CGATGCGACG
CTGCTGCTGA TCGACAATCC GCATGCCGAG CACGCCCAGT TGATGGAAGT GTTCCACGGG
CCCGACCTGC CGACGGGCGG CGTCATCGTC GACAGTCCGG CGGTGATTTC CAACGCCTAT
GAAACCGGAC GCGGCGCGAT CCGGGTGCGC GGGCGCTTTT CGACGGGCCG CGACGAGGCT
GGCAACTGGG AAGAGAGCGG CATCGAGAAG CTGGGCGGCG GGCAGTGGCA GCTCGTGGTC
TCGGAAATCC CCTACATGGT GCAGAAGGGC AAGCTGATCG AGCAGATTGC CCAGCTCATC
GCCGACAAGA AGCTGCCGAT CCTCGAGGAC ATCCGCGACG AGAGCGACGA GCAGATCCGG
CTGGTGCTGA TCCCCAAGAG CCGAAATGTC GACCCCGACC TGCTCAAGGA ATCGCTCTAC
CGACTGACCG ACCTTGAAAC GCGGTTCGGG CTCAACCTCA ACGTGCTGGA TTCGCGGCGG
ACGCCCGGCG TGCTGGGGCT GAAGCTGGTA TTGCAGGAGT GGGTGATCTC GCAGATCGAC
ATCCTGCTGC GGCGTTCGCG CCATCGGCTG GACAAGATCG CCTCGCGGCT CGAGCTGCTC
GAAGGCTATA TCATTGCCTA TCTCAACCTC GACCGGATCA TCGAGATCAT CCGCACCGAG
GACGAGCCCA AGCCGGTGAT GATGGCCGAG TTCGAGCTGA CCGACCGCCA GGCCGAGGCG
ATCCTCAACA TGCGGCTGCG ATCCCTGCGC AAGCTCGAGG AAATGGAGCT GCGCAAGGAA
CGCGATGCCC TGCTGGCAGA GCAAGAGGAG CTGCAGAAGC TGCTCGACAG TCCGGCGCGC
CAGCGCACCC GGCTGAAGCG CGACCTGGCG GCGCTGCGCA AGGACTATGC CGAGGACACC
GCGCTGGGAC GGCGGCGCAC GACGATTGCC GAAGCCGCGC CGACGCGCGA GTTCAGCATG
GACGCGATGA TCGAGAAGGA GCCGGTGACG GTGATCCTTT CGGCCAAGGG CTGGATCAGG
GCGGCCAAGG GGCATGTGCC GCTCGATGGC GATTTCAAGT TCAAGGAAGG CGATGGCCCG
GCCTTCGCAC TTCACTGCCA GACCACGGAC AAGCTGCTGG TGGCGGTGGA CAACGGGCGG
TTCTACACGC TGGGGGCCGA CAAGCTGCCG GGCGCGCGGG GCTTTGGCGA GCCTATCAGG
ACGATGGTGG ACATCGATCC GGATGCGCAG ATCGTTTCGG TCCTGCCCTA CAAGCCCAAG
GGGCAACTGC TGCTCGCGGC GAACACCGGG CGCGGCTTTG CCGCCGAGAT GGACGAACTG
CTGGCCGAGA CGCGAAAGGG GCGCGGGGTG GTTTCAACCA AGCCGGGCGT CAAGCTTGCC
GTGGTGCGCG AGATCGCGCC CGAGCACGAT CACGTCGCGG TGATCGGCGA GAACCGCAAG
CTGGTGATCT TCGCGCTTTC GGAAGTGCCG CTGCTCGCCA AGGGGCAGGG CGTCACGCTG
CAGCGCTACA AGGACGGCGA GATGTCGGAC GTGATCACGC TGCGGCTCGA AGATGGGCTT
ACCTGGGCGA TGGGCGGCGA GAGCGGACGG ACGCGGGTGG AGAAGGATCT GCTGCCGTGG
AAGGTTGCGC GCGGCGCCGC GGGGCGCCTG CCGCCGAACG GGTTTCCGCG AGATAACCGG
TTCTGA
 
Protein sequence
MATDTTEPDP FDAIVDAPFD SALSERYLVY ALSTITARSL PDLRDGLKPV HRRLLWAMRQ 
LKLDPAQAFK KSARVVGDVI GKYHPHGDAS VYDAMVRLAQ DWALRYPLVE GQGNFGNIDG
DNAAAYRYTE ARLTKTAIQL MSGLDEGTVD FVPTYNGEEE EPEIFPGLFP NLLANGSSGI
AVGMATNIPS HNVAEIIDAT LLLIDNPHAE HAQLMEVFHG PDLPTGGVIV DSPAVISNAY
ETGRGAIRVR GRFSTGRDEA GNWEESGIEK LGGGQWQLVV SEIPYMVQKG KLIEQIAQLI
ADKKLPILED IRDESDEQIR LVLIPKSRNV DPDLLKESLY RLTDLETRFG LNLNVLDSRR
TPGVLGLKLV LQEWVISQID ILLRRSRHRL DKIASRLELL EGYIIAYLNL DRIIEIIRTE
DEPKPVMMAE FELTDRQAEA ILNMRLRSLR KLEEMELRKE RDALLAEQEE LQKLLDSPAR
QRTRLKRDLA ALRKDYAEDT ALGRRRTTIA EAAPTREFSM DAMIEKEPVT VILSAKGWIR
AAKGHVPLDG DFKFKEGDGP AFALHCQTTD KLLVAVDNGR FYTLGADKLP GARGFGEPIR
TMVDIDPDAQ IVSVLPYKPK GQLLLAANTG RGFAAEMDEL LAETRKGRGV VSTKPGVKLA
VVREIAPEHD HVAVIGENRK LVIFALSEVP LLAKGQGVTL QRYKDGEMSD VITLRLEDGL
TWAMGGESGR TRVEKDLLPW KVARGAAGRL PPNGFPRDNR F