Gene Sare_1698 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1698 
Symbol 
ID5704009 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1960415 
End bp1961494 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content72% 
IMG OID641271201 
Producthypothetical protein 
Protein accessionYP_001536576 
Protein GI159037323 
COG category[S] Function unknown 
COG ID[COG4850] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000770913 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGTCGGGG TCGGGCGGTT GTGGCAGGGT GGCCGTGTGC CACCGACTGC CGCCGGCCAA 
CTGGCCGTAC CCCAGCTGCA CCGGGCCGCG CGGATCGAGG ACGCCGTGCA CCACCTGGTC
GAGCGGCGGC TGCGGCGGAC CGGCTGGCGG ATCCACACCG TGGCCTACCC GGGCTACGGC
GCCCCTGGCT GGATCCGGGT GATGTGTCGG GTGCTGTTGG GGCGGCCGGA CAACCGGCAG
CGGGGGCGGC CGGAGAAGGT TCGGGGCTGG CGCAGCTTCG CCACCCTGCC GGCCAAGTAC
GTCACGGTGG CCATCGAGTC GGGGGACGTA CGGCACGAGA CGCGGACCGA CCGCAGCGGC
TTCGTGGACA CGATCGTGCC GGTCGACCTG CCTCCCGGGT GGGGGTCGGT GTGGATAAGC
GTCCCGGAGG CCGAGCCGGT CCAGGCGCCG GTACGGATCC TGGACCCGCA GGTACGGTTC
GGGGTCATCT CCGACGTCGA CGACACGGTC ATGGTCACCA CGCTTCCGCG GCCACTTCTC
GCCGCCTGGA ACACGTTCGT GCTGGACGAG CATGCCCGAG CCGCGGTGCC CGGGATGGCC
GTGCTGTACG AGCGGCTGGT CACGGCCCAC CCCGGCGCCC CGGTGTTCTA CCTGTCCACC
GGCGCCTGGA ACGTGGCGCC GACACTCACC CGGTTCCTGT CTCGGCACCT CTACCCGGCT
GGGCCGCTGC TGCTCACCGA CTGGGGTCCG ACGGCAGACC GGTGGTTCCG CAGTGGTCGG
GAGCACAAGC GAGCCACCCT GACCCGACTG GCCACGGAGT TCCCCGAGGT GAAGTGGCTG
TTGGTGGGCG ACGACGGCCA GCACGACCAG GAGATCTACC GGGAGTTCGC CGTGGCCCAC
CCGGACAACG TCGCGGGGGT GGCGATTCGC CGGCTCTCAC CGACCCAGGC GGTGCTCGCC
GGTGCTCCGC CCAACCCGGT CAGCGACAGC GCGTCGGTTC CTCCGGTGGG GCAGAAATGG
CTCTCCGCGC CCGACGGCGC CGGGCTGTGG CAGCTGCTGC GGGAGGCGGG TCTGGTCTGA
 
Protein sequence
MVGVGRLWQG GRVPPTAAGQ LAVPQLHRAA RIEDAVHHLV ERRLRRTGWR IHTVAYPGYG 
APGWIRVMCR VLLGRPDNRQ RGRPEKVRGW RSFATLPAKY VTVAIESGDV RHETRTDRSG
FVDTIVPVDL PPGWGSVWIS VPEAEPVQAP VRILDPQVRF GVISDVDDTV MVTTLPRPLL
AAWNTFVLDE HARAAVPGMA VLYERLVTAH PGAPVFYLST GAWNVAPTLT RFLSRHLYPA
GPLLLTDWGP TADRWFRSGR EHKRATLTRL ATEFPEVKWL LVGDDGQHDQ EIYREFAVAH
PDNVAGVAIR RLSPTQAVLA GAPPNPVSDS ASVPPVGQKW LSAPDGAGLW QLLREAGLV