Gene Sare_3462 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3462 
Symbol 
ID5708064 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3993051 
End bp3994310 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content72% 
IMG OID641272889 
ProductDNA-directed DNA polymerase 
Protein accessionYP_001538255 
Protein GI159039002 
COG category[L] Replication, recombination and repair 
COG ID[COG0389] Nucleotidyltransferase/DNA polymerase involved in DNA repair 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00370364 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGCCGCA GCCAGTCGTT GCCGCGGGGC GACGACCCGC GCTTCGGACC GGACGCCGAT 
GACTCCGGAT GCCGGGTGCT GCACGTCGAC ATGGACGCGT TCTTCGCCTC CGTCGAGGTA
CGCCACCGGC CGGAGCTGCG TGGCCGGCCG GTGGTGGTCG GTGGGCTTGG GCCGCGTGGA
GTGGTCAGCT CTGCGAGCTA CCCGGCCCGG CGATACGGCG TCCGCAGCGC GATGCCGACC
GCGCGGGCCC GGGCGCTCTG CCCACACGCG GTGTTCCTGC CGCCCGATTT CACCGCCTAC
ACGACCGCCT CCCGCACCGT GATGCAGATC TTCCAAGACG TCACCCCGCT GGTCGAGCCG
CTCTCCCTGG ACGAGGCGTT CCTTGACGTG GCCGGCGCCC AGCGGCTGTT CGGCCAACCA
TCGGCGATCG CCCGGCTGAT CCGCGAGCGG GTGGCGACCG AGGTGGGGCT CACCTGCTCG
GTTGGGGTGG CCTCGAGCAA GTTCGTCGCG AAGCTCGGCT CGACCCGGGC CAAACCGGAC
GGGATGCTGG TGGTCCCGAC CGCGCGGGTA CTTGACTTCC TGCATCCGCT GCCGGTGGAG
GCGCTGTGGG GGGTCGGCGA GCGGTCGGCC GAGACACTAC GCCGGCTCGG CCTGACCACT
GTCGGTGAGC TGGCGCAGGC GCCCGACGGG ATGCTCCGTC GGGCGCTCGG TACCGCTGCG
GCCCGTCACC TCCGCGAGCT GGCATGGGGC AAGGACCCGC GGCGGGTCAC CTCGGAACGG
GAGGACAAGT CGATCGGCGC GGAGGTGACG TTCGACGCCG ACGTGACCGA TCGACGGGAG
ATCCGACGTG CCCTGCTCGG GCTCGCCGAG AAGGTCGGTG CTCGGCTGCG CCGGTCCGGC
CAGGTGGGGC GGACGGTGGC GTTGAAGGTT CGGCTGGCCG ACTTTCGTAC CGTTAGCCGT
TCCCGCAGCC TCGACGTCCC GACCGATGTC GGTCGGGAGA TGTTCGACAC AGCCTGGGCG
CTTTACACCG CTCTCGACCC GGGGGAGCCG ATCCGGTTGG TCGGTGTCCG GGCCGAAGGA
CTCGCGACCG CCCGGAACGC CCCCCGGCAG CTCGCGCTCG GCGAGCCTGA GCGAGGATGG
CGGGAGGCCG AACGTGCCGT TGACGCCGCC GCCGCCCGTT TCGGGCGGTC CGTCATCGGC
CCAGCCAGCC TGCTTCGTGC CCGTGACCAG CACCTGCGGG AAAATCCGCG TCGGCCGTAG
 
Protein sequence
MGRSQSLPRG DDPRFGPDAD DSGCRVLHVD MDAFFASVEV RHRPELRGRP VVVGGLGPRG 
VVSSASYPAR RYGVRSAMPT ARARALCPHA VFLPPDFTAY TTASRTVMQI FQDVTPLVEP
LSLDEAFLDV AGAQRLFGQP SAIARLIRER VATEVGLTCS VGVASSKFVA KLGSTRAKPD
GMLVVPTARV LDFLHPLPVE ALWGVGERSA ETLRRLGLTT VGELAQAPDG MLRRALGTAA
ARHLRELAWG KDPRRVTSER EDKSIGAEVT FDADVTDRRE IRRALLGLAE KVGARLRRSG
QVGRTVALKV RLADFRTVSR SRSLDVPTDV GREMFDTAWA LYTALDPGEP IRLVGVRAEG
LATARNAPRQ LALGEPERGW REAERAVDAA AARFGRSVIG PASLLRARDQ HLRENPRRP