Gene Saro_2780 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2780 
Symbol 
ID3916940 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2999521 
End bp3001764 
Gene Length2244 bp 
Protein Length747 aa 
Translation table11 
GC content65% 
IMG OID640445559 
Productexcinuclease ABC subunit B 
Protein accessionYP_498050 
Protein GI87200793 
COG category[L] Replication, recombination and repair 
COG ID[COG0556] Helicase subunit of the DNA excision repair complex 
TIGRFAM ID[TIGR00631] excinuclease ABC, B subunit 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.768947 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGTGGA TCACTTCGGT TGCGCGGACG CTGTTTGTTC CCCATATCCG GCGCATGGCC 
GAACTCGTTA TCCGCAGGGG ACTCGAAGAG CCCGACACCT CCGGCACCTT CGTGCCCCAC
CGCCCCGCAA GGCCGGACAA GGTGGAAGGC GGCAAGCGCT TCAGGATCGT GTCCGACTAC
CAGCCGGCGG GCGACCAGCC GACCGCCATC GCCGATCTCG TCGAAGGCAT CCGCGCGGAT
GACAAGACGC AGGTCCTGCT TGGCGTCACC GGTTCGGGCA AGACCTTCAC GATGGCGCAG
GTCATCGAGG CGACCCAGCG GCCCGCGCTG ATCCTTGCCC CCAACAAGAT CCTCGCCGCC
CAGCTCTATG GCGAGATGAA GAGCTTCTTC CCCGAAAACG CGGTCGAATA TTTCGTCTCC
TACTACGACT ACTACCAGCC GGAGGCCTAC GTGCCCCGGT CGGACACCTA CATCGAGAAG
GAAAGCTCGG TGAACGAGGC GATCGACCGG ATGCGCCACT CGGCCACCCG CGCCCTGCTG
GAGCGCGACG ACGTGATCAT CGTCGCCTCG GTCTCGTGCC TCTATGGCAT CGGCTCGGTC
GAAACCTACT CGGCCATGAT CTTCGACCTC AAGGTCGGCA CCACGGTCGA CAGCGGCGAG
ATCATCCGCA AGCTGGTGGC CCTGCAGTAC AAGCGCAACG ATGCCGCCTT CAGTCGCGGC
AACTTCCGCG TACGCGGCGA CAATCTCGAG ATCTTCCCCT CGCACTACGA AGACGTTGCC
TGGCGCATCT CGTTCTTCGG CGACGAGATC GAGCAGATCG TCGAGTTCGA TCCGCTGACC
GGCAAGGCGG GCACGAAGCT CACCGCGATC CGCGTCTACG CCAATTCGCA CTACGTGACG
CCCGGCCCGA CGATGAAGCA GGCCGCAGAC GCGATCCGCT TCGAACTGAC CGAGCGGCTC
AAGGAACTGG TCGCGGAAGG AAAGCTGCTC GAAGCGCAAC GGCTGGAACA GCGCACCAAC
TTCGACCTGG AAATGATCGC CGCGACCGGT TCATGCGCCG GGATCGAGAA CTACAGCCGT
TTCCTGACCG GCCGCCTCCC CGGCGAACCG CCCCCCACGC TGTTCGAATA CCTGCCGGAC
AATGCCCTGC TCTTCGTCGA CGAGAGCCAC CAGACGGTGC CGCAGATCGG CGCGATGGCG
CGAGGCGACC ATCGCCGCAA GCTTACGCTC GCCGAATACG GCTTCCGCCT GCCGAGCTGC
ATCGACAACC GACCGCTGCG CTTCAACGAA TGGGACGCGA TGCGCCCCCA GACGGTCGCG
GTCTCGGCCA CCCCGGGCGG CTGGGAAATG GAGCAGGCCG GCGGCGTCTT TGCCGAACAG
GTCATCCGCC CGACCGGCCT GATCGACCCG CCGGTGCTGA TCCGCCCGGT CGAGGACCAG
GTGCAGGACT GCATCAACGA GTGCCGCGAG ACCGCCGCCA AGGGCTATCG CACGCTCGTC
ACCACCCTGA CCAAGCGCAT GGCGGAAGAC CTGACCGAGT TCATGCACGA AGCGGGCCTG
CGCGTACGCT ACATGCACTC CGACGTCGAG ACGCTGGAGC GCATCGAGCT GATCCGCGAC
CTGCGGCTTG GCGTCTATGA CGTTCTCGTC GGCATCAACC TGCTGCGCGA AGGTCTCGAC
ATTCCCGAGT GCGGCCTCGT CTGCATCCTC GATGCCGACA AGGAGGGCTT CCTGCGCTCC
GAGACCTCGC TGATCCAGAC CATCGGCCGC GCCGCGCGCA ACGTCGATGG CCGCGTCATC
CTCTATGCCG ATCGCATGAC CGGCTCGATG GAACGCGCCA TCGCCGAAAC CGACCGCCGC
CGCGCAAAGC AGCAGGCCTA CAACGAAGAA CACGGCATCA CGCCGCAAAC GATCAAGCGC
AACATCCACG ACATCGTCGC GGATACCGCC AGCCGCGACG GCGTGGTCGT CGACACCGGC
GACGACGAGC GCAACAACCT CGTCGGCCAC AACCTGCGCG CCTATATCGA GGACCTCGAA
AAGCGCATGC GCGCGGCCGC AGCGGACCTC GAATTCGAGG AAGCCGGCCG CCTGCGCGAC
GAGATCAGGC GGCTCGAGGC CACCGAACTC GGCCTGCCTG AAGGCGAGCG GAAAGCGCCG
ATCGTGGGAC GCAGCAACGA AGGCAAGCCG GGTACGCGCA AGACGCGCTA CGGGAAGTCA
CAGAAGACGA AGTGGGGGAA GTAG
 
Protein sequence
MAWITSVART LFVPHIRRMA ELVIRRGLEE PDTSGTFVPH RPARPDKVEG GKRFRIVSDY 
QPAGDQPTAI ADLVEGIRAD DKTQVLLGVT GSGKTFTMAQ VIEATQRPAL ILAPNKILAA
QLYGEMKSFF PENAVEYFVS YYDYYQPEAY VPRSDTYIEK ESSVNEAIDR MRHSATRALL
ERDDVIIVAS VSCLYGIGSV ETYSAMIFDL KVGTTVDSGE IIRKLVALQY KRNDAAFSRG
NFRVRGDNLE IFPSHYEDVA WRISFFGDEI EQIVEFDPLT GKAGTKLTAI RVYANSHYVT
PGPTMKQAAD AIRFELTERL KELVAEGKLL EAQRLEQRTN FDLEMIAATG SCAGIENYSR
FLTGRLPGEP PPTLFEYLPD NALLFVDESH QTVPQIGAMA RGDHRRKLTL AEYGFRLPSC
IDNRPLRFNE WDAMRPQTVA VSATPGGWEM EQAGGVFAEQ VIRPTGLIDP PVLIRPVEDQ
VQDCINECRE TAAKGYRTLV TTLTKRMAED LTEFMHEAGL RVRYMHSDVE TLERIELIRD
LRLGVYDVLV GINLLREGLD IPECGLVCIL DADKEGFLRS ETSLIQTIGR AARNVDGRVI
LYADRMTGSM ERAIAETDRR RAKQQAYNEE HGITPQTIKR NIHDIVADTA SRDGVVVDTG
DDERNNLVGH NLRAYIEDLE KRMRAAAADL EFEEAGRLRD EIRRLEATEL GLPEGERKAP
IVGRSNEGKP GTRKTRYGKS QKTKWGK