Gene Saro_2274 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2274 
Symbol 
ID3916590 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2413476 
End bp2415197 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content71% 
IMG OID640445028 
Productsingle-stranded-DNA-specific exonuclease RecJ 
Protein accessionYP_497545 
Protein GI87200288 
COG category[L] Replication, recombination and repair 
COG ID[COG0608] Single-stranded DNA-specific exonuclease 
TIGRFAM ID[TIGR00644] single-stranded-DNA-specific exonuclease RecJ 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.349175 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACCTCG GCGGCGCGGG TGGCATCCAG GGTGGCGGGC TTTCCGACGA CATCGTGACG 
CAGCTCCTGC TGTCGCGCGG AGTGGGCCGC GAGGACCTCG ACCGCCATCG CCGCCCGACG
TTGCGCGACT TCCTGCCGGA CCCGTCGATC TTCCAGGACA TGGACAACGC CGCACGCCGG
CTGACCGATG CGATCCTGTC GGGCGAGCGC ATCACGATCT ACGGCGACTA CGACGTCGAT
GGGGCGACCA GCGCGGCGCT GCTGATCCGG CTGCTGCGCG AACTTGGCGT CGATGCGGGG
CACTATATCC CCGACCGCCT GCTGGAAGGC TATGGGCCCT CGGGCGAAGC CCTGGTCCGC
CTGGCGCGCG AGGGGTCGAA TCTCATCGTC ACGGTCGACT GCGGCGCGAT GGCGCACGAA
GCGCTTGCGG CGGCGGCAGC CGAAGGCGTG GACGTAATCG TCGTCGACCA CCACAAGTGC
GCCCCCGAGC TTCCCCGCGC GGCGGCGCTG GTCAATCCCA ACCGCCTTGA CGAATGCGAC
GAAGCTGCCG CCCACGGGCA TCTGGCAGCC GTGGGCGTCG CCTTCCTGCT GGCGGTGGCG
ACCGTGCGCG AACTGCGCGG ACGCGGCTTT TTCGACGGGC GCAAGGCACC GGACCTGATG
GCACTGCTGG ACCTCGTGGC GCTGGGCACG GTGGCCGACG TCGCGCAGCT CAAGGGTCTC
AACCGCGCGC TCGTCTCGCA GGGGCTCAAG ATCATGGCGC GGCGCGAGAA CGTCGGCCTT
TCCGCGCTGA TCGACGCCAG CCGGCTGAGC CGCGCGCCCA CGTGCAGCGA TCTCGGCTTC
GCGCTCGGGC CGCGGATCAA TGCGGGCGGG CGCGTTGGCG AGGCGACGCT GGGCGTACGC
CTGCTGACGA CCGAGGACCC GGAGGAAGCG CGGGCGATCT CGGCCCAGCT ATCGCGGCTC
AACGACGAGC GCCGCGCCAT CGAGCAGGCC GTGCAGGAGG CCGCCGAGGC GCAGGTCGAC
GCGCAGCACA ACCGGGCCGT GATGGTCCTG GCGGGCAGCG GCTGGCATCC GGGCGTGATC
GGCATCGTGG CCGGACGGAT CAAGGAGAAG ACCGGCAAGC CGACCCTCGT GATCGCGCTC
GACGCGGACG AGGCCGGGCA TGGCAAGGGG TCGGGCCGCT CGATCTCGGG GGTGGACCTT
GGCGCGGCGA TCATCGCCGC GCGCGAGGCC GGATTGCTGG TGGCGGGCGG CGGACACGCC
ATGGCCTGCG GATTGACCAT CGAGCCTTCA GCGCTGTCGC GACTTGCGGA TTGGCTGGAC
GAAAGGCTTT CACGCGACGT GACGGCGGCG CAATCGTCGC AGGCGCTGCT GCTGGACCTT
TCGCTAAGTC CGGGCGGGCT GACGCCGGAT CTGGTCGAGA CGCTCGAGGC GGCAGGCCCG
TTCGGCGTGG GCTGGCCCGG GCCGCGCGTG GCAGTGGGCC CGGTGCGGCT GGTCAAGTGC
GACCTCGTCG GCACCGACCA CGTGAGGATG GTCGCCGCAG GTGCCGACGG CCGGTCGTTC
AAGGCCATCG CCTTTCGCGC GGCGCAGAGC GAACTTGGAC AGGTTCTGCT CAACGCATCG
CGCGGCCGAC AGTTCTGGCT GGCGGGTCGC GCGCGGATCG ATGACTGGGC CAGCCGCCCG
GCGGCCGAAC TCCATGTCGA GGATGCCGCA TTCGCCGACT GA
 
Protein sequence
MDLGGAGGIQ GGGLSDDIVT QLLLSRGVGR EDLDRHRRPT LRDFLPDPSI FQDMDNAARR 
LTDAILSGER ITIYGDYDVD GATSAALLIR LLRELGVDAG HYIPDRLLEG YGPSGEALVR
LAREGSNLIV TVDCGAMAHE ALAAAAAEGV DVIVVDHHKC APELPRAAAL VNPNRLDECD
EAAAHGHLAA VGVAFLLAVA TVRELRGRGF FDGRKAPDLM ALLDLVALGT VADVAQLKGL
NRALVSQGLK IMARRENVGL SALIDASRLS RAPTCSDLGF ALGPRINAGG RVGEATLGVR
LLTTEDPEEA RAISAQLSRL NDERRAIEQA VQEAAEAQVD AQHNRAVMVL AGSGWHPGVI
GIVAGRIKEK TGKPTLVIAL DADEAGHGKG SGRSISGVDL GAAIIAAREA GLLVAGGGHA
MACGLTIEPS ALSRLADWLD ERLSRDVTAA QSSQALLLDL SLSPGGLTPD LVETLEAAGP
FGVGWPGPRV AVGPVRLVKC DLVGTDHVRM VAAGADGRSF KAIAFRAAQS ELGQVLLNAS
RGRQFWLAGR ARIDDWASRP AAELHVEDAA FAD