Gene Sare_4641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4641 
Symbol 
ID5706228 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5262068 
End bp5263075 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content70% 
IMG OID641274042 
ProductADP-ribosylation/crystallin J1 
Protein accessionYP_001539389 
Protein GI159040136 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1397] ADP-ribosylglycohydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGAAC TGAGTGTGTC GACTGCGGCA TCAGGGTGTT TGTTCGGCCT GGCATACGGC 
GACGCGCTGG GCAAACCGAC CGAGTTCATG ACGGTCGCCG ACATCGTTGC CCAGTACGGA
CCCGGTGGCC CTCGTGAGTT GGCGGGCGAT CCCGCTCTGG TCACCGACGA CACCCAGCTG
ACCCTGGCGG TCGGGACGGC GTTGATGAAC GCGCCCACGC TGTCGGCCGA GGTGCTGGAG
CCCCTGCTTC GCGAGCGGTT CGTGCGCTGG GCCGGCAGCC CCGACAACGA CCGGGCGCCC
GGTATGACCT GCCTGCGAGC TTGCGGTGAC CTCGCGCTCG GCCGGCCGTG GACACAGGCC
ACAGTGATCG GCTCCAAAGG CTGCGGCGCG AACATGCGGG TCGCGCCGGT CGGGCTGGTC
GCCGGCGACG ACCTGGACAC CCTTGCCGGG GTGGCGCAGC TGCAGGCGGC GATGACACAT
GGGCATCCCA CCGCCCTCGC GGCCAGTGAG TTGACCGCAT ACGCCGTGCG GCTGTTGTGC
GACGGGACGG AGCCGGCCGT CCTGCCCGCC CTGCTCCGGG CCCGATGCCA CGACCAGCGC
ACTGTCTACC GCGCCGAGTG GCTGGACGTG CTGTGGCAAC AGCCCGGGGT CGCCAGCCCC
GCCGACTACA TCAGCCGGGG CTGGGACGAG TGCCTACGGG TACTGGATCG GCTCGATCTC
GCACTCGCCC CGGCCGACGA CCGTGACGAC GCCTGTCGGG TCACCGGTGC CGGTTGGGTC
GCCGAGGAGG CCCTGGCCAC GGGACTGCTG TGTGCGATCC GGCATACCGA CGATCCGGTG
TCCGCTCTTG CCCGCGCCGC TACGACCTCA GGTGACTCCG ACTCCATCGC CTGCCTGACC
GGCGCGTTCC TCGGCGCCGC GTTCGGTATG GCCGCCTGGC CGGTGTCCTG GCGTGATCGG
ATCGAGTACG CCGACGAGCT CACCACGATG GGTGAAGCCT GGAACTGA
 
Protein sequence
MRELSVSTAA SGCLFGLAYG DALGKPTEFM TVADIVAQYG PGGPRELAGD PALVTDDTQL 
TLAVGTALMN APTLSAEVLE PLLRERFVRW AGSPDNDRAP GMTCLRACGD LALGRPWTQA
TVIGSKGCGA NMRVAPVGLV AGDDLDTLAG VAQLQAAMTH GHPTALAASE LTAYAVRLLC
DGTEPAVLPA LLRARCHDQR TVYRAEWLDV LWQQPGVASP ADYISRGWDE CLRVLDRLDL
ALAPADDRDD ACRVTGAGWV AEEALATGLL CAIRHTDDPV SALARAATTS GDSDSIACLT
GAFLGAAFGM AAWPVSWRDR IEYADELTTM GEAWN