Gene Sare_4126 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4126 
Symbol 
ID5708106 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4685353 
End bp4686309 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content70% 
IMG OID641273554 
ProductNUDIX hydrolase 
Protein accessionYP_001538907 
Protein GI159039654 
COG category[L] Replication, recombination and repair 
COG ID[COG2816] NTP pyrophosphohydrolases containing a Zn-finger, probably nucleic-acid-binding 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGACG GCGGATCGGG CCCGCCGTTG GCCCGTTCCA CCCTGGACCG GGCGGCGCAC 
CGGCGTGCCG ATACCGACTG GCTGGGGCTG GCCTGGGAAC GGTGCAGGGT GCTCGTGCTG
GACAGCGGCA ACGGGGGGCG GGCGTTGGTG TCCGTCGAAC CGGAGCCATC GGAGCCACCG
CTGCTGGTGC TGGTCGGCCC GGAGGACGTA CCGGACACGG CGGCTGCGCG AGCGATGTTC
CTCGGCGTCG AGTCAGATGG CGTGCCGGTG TTCGTGGTGG ATGCGCCACT GCCGGTGCTG
TCCGATACCC GCGCGGTGCA CCTGTTGGAG GTCGGCCATC TGCTCGCCGA CCGGGACGCG
GGTCTGTTCA CCACGGGTCT GGCCCTGCTC AACTGGCACC GTGGGCACCC GTACTCTCCG
CGTACCGGGC AGGCGACCGC GATAGACGAG GCCGGTTGGT CCCGGGTTGC GCCAGCAGGG
GAGCGGATGT GGCCGCGGAC CGACCCGGCG ATGATCGTTC TGGTGCACGA TGGTGGGCCC
GGGCGTGCTG GGCGCTGTCT GCTGGGTAAC AACGCCTCCT GGCCACGCGT ACCGGGTGAG
CTGCGCTTCT CCTGCCTCGC CGGCTATGTG GAGCCGGGGG AGTCGGCCGA GGCAGCGGTG
GTGCGTGAGG TGCGCGAGGA GGTCGACGTA CCGGTCACGA ACGTCACGTA CGTGGGTAGC
CAGGCATGGC CGTTCCCGAG TTTGTTGATG TTGGGATTTC AGGCACTGGC CGATTCGCGG
TCTCCGGTGC GGGTCGACCC GGCGGAGATC GCATCAGCCC GTTGGTTCAC CCGGGCCGAG
ATCGGGGCGG CGCTGGCCGG GCGGATTGTT GATGTGGATG GTGAGCGGCT GGTCCTGCCA
CCACCGTCGT CGATCGCGTC CTTCCTGCTG CGTCACTGGC TCGACGGCCA TTGCTGA
 
Protein sequence
MTDGGSGPPL ARSTLDRAAH RRADTDWLGL AWERCRVLVL DSGNGGRALV SVEPEPSEPP 
LLVLVGPEDV PDTAAARAMF LGVESDGVPV FVVDAPLPVL SDTRAVHLLE VGHLLADRDA
GLFTTGLALL NWHRGHPYSP RTGQATAIDE AGWSRVAPAG ERMWPRTDPA MIVLVHDGGP
GRAGRCLLGN NASWPRVPGE LRFSCLAGYV EPGESAEAAV VREVREEVDV PVTNVTYVGS
QAWPFPSLLM LGFQALADSR SPVRVDPAEI ASARWFTRAE IGAALAGRIV DVDGERLVLP
PPSSIASFLL RHWLDGHC