Gene Sare_1497 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1497 
Symbol 
ID5705475 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1723931 
End bp1725079 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content74% 
IMG OID641271005 
Productnuclease SbcCD, D subunit 
Protein accessionYP_001536386 
Protein GI159037133 
COG category[L] Replication, recombination and repair 
COG ID[COG0420] DNA repair exonuclease 
TIGRFAM ID[TIGR00619] exonuclease SbcD 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.231622 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000221272 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAAGATCC TGCACACCTC CGACTGGCAC GTCGGCAAGG TCCTCAAGGG ACGCTCACGG 
GCCGAGGAGC ACAAGGCCGT CCTCGCCGGG GTGATCGAGG TGGCCCGGCG GGAACGGCCG
GACCTGGTCG TCGTGGCCGG TGACCTGTAC GACACCGCCG CGCCCACACC CGAGGCGACC
CGGTTGGTCA CCCGCGCGCT GACCGCGCTG CGCCGCACCG GCGCCGACGT CGTGGCGATC
GGCGGCAACC ACGACAACGG CGCGGCGCTG GACGCGCTGC GACCGTGGGC GGAGGCCGCC
GGGATCACCC TGCGGGGCAG CGTGCGCGAG GACCCCGACG AGCACGTCAT CGACGGTACG
ACCGCCGGTG GCGAACGCTG GCGGCTCGCC GCGCTGCCGT TCCTGTCGCA GCGGTACGCG
GTCCGGGCCG TGGAGATGTA CGAGCTGACC GCCGCCGAGG CCAACCAGAC CTACGCCGAC
CACCTGGGGC GGATTCTCAC CCGGTTGACC GAGGGCTTCG CCGAGCCGGA CCGGGTGCAC
CTGGTCACCG CCCACCTGAC CGTGGTCGGG GCGGCGACCG GCGGCGGCGA GCGGGACGCG
CACACCGTCC TGGGCTACGC GGTGCCGGCG ACCGTGTTCC CCGGCACCGC GCACTACGTC
GCGCTCGGTC ACCTGCACCG GGCCCAGCAG GTCCAGGGCG GGTGCCCGAT CCGCTACAGC
GGCAGCCCGC TGGCCGTCGA CTTCGGCGAG CAGGAGAACG TCCCGTCGGT GACCGTGGTC
GAGGTGACCG CGACCACGGC GGCGCAGGTG CGGGAGGTAC CCATCCCCGC CGCCGCCTCG
CTGCGGACCG TCCGGGGCAC GCTCGCGCAG CTCGCCGAGA TCGAGGCACC GGACGCCTGG
CTGCGGGTCT ACGTCCGGGA GCAGCCCCGA GCCGGCCTCC GCGAGGAGGT GCAGGAGCTG
TTCCCCCGCG CCTTGGAGAT CAGGATCGAT CCGGAGCTGG TGCCCGCGTC CGGCAGCGGC
ACCCGCATCG CCCAGCGCGC CGGCCGGTCA CCGCGGGAAC TGTTCGGCGA CTACCTGGAC
GGTCGGGGAC ACACCGACGA CGACGTCCGC GGGCTCTTCG ACGAGCTGTT CGAGGAGGTC
GAGCACTGA
 
Protein sequence
MKILHTSDWH VGKVLKGRSR AEEHKAVLAG VIEVARRERP DLVVVAGDLY DTAAPTPEAT 
RLVTRALTAL RRTGADVVAI GGNHDNGAAL DALRPWAEAA GITLRGSVRE DPDEHVIDGT
TAGGERWRLA ALPFLSQRYA VRAVEMYELT AAEANQTYAD HLGRILTRLT EGFAEPDRVH
LVTAHLTVVG AATGGGERDA HTVLGYAVPA TVFPGTAHYV ALGHLHRAQQ VQGGCPIRYS
GSPLAVDFGE QENVPSVTVV EVTATTAAQV REVPIPAAAS LRTVRGTLAQ LAEIEAPDAW
LRVYVREQPR AGLREEVQEL FPRALEIRID PELVPASGSG TRIAQRAGRS PRELFGDYLD
GRGHTDDDVR GLFDELFEEV EH