Gene Sare_2375 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2375 
Symbol 
ID5705116 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2730498 
End bp2731559 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content73% 
IMG OID641271853 
Product5'-3' exonuclease 
Protein accessionYP_001537224 
Protein GI159037971 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0822086 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00237783 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTGTCGA CGCCCACCAC CGACGAGGAC CCGGCGCGGG AGGCCCCATG GCAGACCACA 
CCGGACCCGG AGCCCGGTCC GCCGGCGTGT CACACCATGC TGGCGTCCAC CCGGACGTCG
GTGCGGTGTG CCAGGCTGTC GGGCGTGACA GCCCCGATCA TGCTCGTTGA TGCGCCCAGC
CTCTACTTCC GGGCCTACTT CGGTATCCCC GAGTCCGCCG CCACCGCGCC GGGCGGTCAA
CCGGTCAACG CCGTTCGCGG CTTCCTCGAC ATGCTGGCAA GTCTGATCCG CACCCGGGGG
CCCGGCCGGA TGGTGTGCGC GATGGACCAC GACTGGCGGC CCGACTGGCG GGTGGCCCTG
CTGCCCTCGT ACAAGGCGCA CCGGGTGGCG CCGGAAGGCG GTGAGGTGGT CCCGGACACC
CTGAGCCCAC AGGTGCCGGT GATCCTCGAC GTGCTCGACG CGCTGGGCAT CGCCACTGTT
GGCGCCTCCG GGTACGAGGC CGATGACGTG CTCGGCACCC TCTCGGTCAC CCAACCGGGG
CCCGTCGAGG TGGTCTCCGG TGACCGCGAC CTGTTCCAGC TGGTCGACGA CGCCCGCGGG
GTGCGGTTGC TCTACATCGG GCGGGGGGTG GCCAAGCTGG CGGACTGCGA CGACACCGCG
GTCCGGGCCC GCTACGGTGT GCCAGCGGCC CGCTACGCCG ACTTCGCCGC GCTGCGCGGC
GACCCCAGCG ACGGGCTGCC GGGGGTGCCC GGCGTCGGCG AGAAGACGGC GGCCCGGCTC
GTTGACCGGC ACGGCGACAT CTCCGGTGTG CTCGCCGCCC TGGACGATCC CGGTGCGGGA
TTCGCGCCGG GGCTGCGCGC GAAACTGGCC GCCGCGCGGG ACTACCTGGC CGTCGCCCCG
ACGGTGGTCC GGGTCGCCCT CGATGTGCCC CTTCCGGCCC TGTCCACCGA CCTGCCGACC
GTGCCGGCTG ACCCCGATCG GCTGCTCGAC CTCGCCGAGC GATGGAACGT CGCCGGTGCC
GTCCGGCGCC TGGTCGATGC CCTGGCCGCC CGAACCGATT GA
 
Protein sequence
MVSTPTTDED PAREAPWQTT PDPEPGPPAC HTMLASTRTS VRCARLSGVT APIMLVDAPS 
LYFRAYFGIP ESAATAPGGQ PVNAVRGFLD MLASLIRTRG PGRMVCAMDH DWRPDWRVAL
LPSYKAHRVA PEGGEVVPDT LSPQVPVILD VLDALGIATV GASGYEADDV LGTLSVTQPG
PVEVVSGDRD LFQLVDDARG VRLLYIGRGV AKLADCDDTA VRARYGVPAA RYADFAALRG
DPSDGLPGVP GVGEKTAARL VDRHGDISGV LAALDDPGAG FAPGLRAKLA AARDYLAVAP
TVVRVALDVP LPALSTDLPT VPADPDRLLD LAERWNVAGA VRRLVDALAA RTD