Gene Sare_4908 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4908 
Symbol 
ID5707424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5575820 
End bp5577028 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content67% 
IMG OID641274303 
ProductXRE family transcriptional regulator 
Protein accessionYP_001539648 
Protein GI159040395 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00382252 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTCGAGC AGCAGCCAAC CCCCGGCCAA CGCGTCGAAC GGCTCCGTCG GGCAGCCGGC 
CTATCCCGCG AACGCCTCGC GGGACTCGCG GGACTCAGCG CGACCACCGT GAAGTTCATT
GAGAACGGCC GGCGATCATT GACCTTGAGG GCGGCGCAGC AACTCGCCCC ACACCTCGGC
GTGCGTGATC TCGGCGATCT ATTCGGCCCT CAGGTTCCCT TGTCTTTGGA TGCCCGACCC
AGTCACCCCG CCGTTGACGA CGTTCGCAGA GCCCTCACTG CCTGGCAGGT CACCATTAGT
GGTGAGCCCG AATCCACTGA CTACCTTCGT GGTGCGGTCG ACTCCGCCTG GCAGACGTGG
CATACCAGCC GCCACCAACG CACCGAGGCC GGTCACCTAC TACCCGGGCT GATAGAGGCA
ACTCAACGCG CCACCCGGCT GCACCACGGG GAGGAACGAC GCGCCTCACT GGCGCTGCTC
GCCCAGGCGT ACCACCTTGC CCAGGCGTTC CTAGCCTGGC ACGGTGACCG TGAGTTGTGC
TGGCTCGCCG TGGACCGGGG CATGACCGCC GCCCTGGACG CCGACGACCC ACTAGCCATC
GCACAGTCGA TCTGGTATGC CGCTCACATA CTCCGCGCTG CAGGACGAGG AAGTGATGCC
CTGGAGCGGC TGGGCGAGGC GCGATCGCTG ATCGAGCCAC ATGTGACTGA CGGTGGCGTC
GAGTGGGCCG AGATGCTCGC CGACCTGCAC CTGTGTATCG CATTGACGAA GGCGCGGATG
GGAGATCACG GAGCTTGGTC TGATTGGGAC ACCGCCCGCA CCGTCGTCGA CCGGGCGCTA
CCCGCCGGGT TTGTCGGCCT ACGCACCCGG GTATCCCGCC CGTTGGTCGA CGTGTACGCG
GTGATGTGCG CTGTGGACCT GGGTGACCCG GACGAGGCAC GGCGTCGCGC CCACGCCCTG
GACCCGGCCT CTATCCCGTC GACCGAACGT CGCGGACGCC ACTATGTGGA GCTGGCGCGA
TCGGCTGACC TGGAAGGGGC ACGCGAGGCG ACCCTACATT TGCTGACCAG GGCTGAGGCC
ACCAGCCCGG AAACCGTGCG GTACTCGCCG GCAGCACAGG ACATGCTGGC ACGGCTCGCG
CGTGAGGCCC CAGCGTCAGT GCGGGCGGAA GCAGTGGACC TAGCCCACCG ATTGGGAGTA
GCAACCTAG
 
Protein sequence
MVEQQPTPGQ RVERLRRAAG LSRERLAGLA GLSATTVKFI ENGRRSLTLR AAQQLAPHLG 
VRDLGDLFGP QVPLSLDARP SHPAVDDVRR ALTAWQVTIS GEPESTDYLR GAVDSAWQTW
HTSRHQRTEA GHLLPGLIEA TQRATRLHHG EERRASLALL AQAYHLAQAF LAWHGDRELC
WLAVDRGMTA ALDADDPLAI AQSIWYAAHI LRAAGRGSDA LERLGEARSL IEPHVTDGGV
EWAEMLADLH LCIALTKARM GDHGAWSDWD TARTVVDRAL PAGFVGLRTR VSRPLVDVYA
VMCAVDLGDP DEARRRAHAL DPASIPSTER RGRHYVELAR SADLEGAREA TLHLLTRAEA
TSPETVRYSP AAQDMLARLA REAPASVRAE AVDLAHRLGV AT