Gene Sare_5021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_5021 
Symbol 
ID5705158 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5691405 
End bp5692343 
Gene Length939 bp 
Protein Length312 aa 
Translation table11 
GC content68% 
IMG OID641274414 
Productcell envelope-related transcriptional attenuator 
Protein accessionYP_001539755 
Protein GI159040502 
COG category[K] Transcription 
COG ID[COG1316] Transcriptional regulator 
TIGRFAM ID[TIGR00350] cell envelope-related function transcriptional attenuator common domain 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00237783 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCTGCTGG CCGGCGTCGC CGTGGTCGGC CTCCGCGAGC TCACCGGGCG GTACGAACAG 
GCAGTGGCCC AGGAGCAACT GCTCGACCCC AGCGCGCGAC AGGACCAGAC AGACCTGGAC
GGGCCATTGA ACTACCTGCT CATCGGCACT GACCGCTGGC GGAGCAGCGG GACCACCGAC
CGACGCTCGG ACGCCATCCT CGTCGTACAC GTGCCGACCG GCGGACGCGA GGCGTTCCTG
ATGTCCGTAC CCCGGGACCT GCTCGTCAGC ATTCCCGCCG CCCCCGGGTA CGGCGGTGGC
CAGGACAAGA TCAACTCCGC ATTCACACAC GGTGGTGGGG GGCAGCCGGG TGCCCGGCTA
CTCTCCGCCA CCCTGGCCCA CCTGGCCGGA ATCCGCTTCG ACGGTGCCGC CGTCATCGAC
TTCTCCGGCT TCCGCGAGGT CATCGACCTC CTCGGCGGCG TCCGAATATG CGTCGACACC
GAGTTCCGTT CGATCCACAC CGACCGGGTC TTCGCCCCCG GATGCCAGGA GATGGACGGT
CCCACCGCGC TGGACTTCGT TCGCCAGCGT CACAACCTGC CCAACGGCGA CTACGACCGA
CAGACCCACC AGCAGCAACT GCTCCGAGCA GTACTGCAAC GCACCTCCGA GACCTCGCTG
CGGAGCGACC CGATCCGCCT GGACCGGGTG ATCCGTGCGG TCGGCAGCTC ACTGACCGTG
GACACCAACG GAGTACCCCT GGCGGACCTG GTGATGGCGT TCCGGGACCT GCCGACGGAC
GCCCTACGCG GAATCCAGAT GCCGTCGCAC TCGCAAACCA TCGGCCAGGT CTCGTACGTC
GTACTCGACG ACGGCGGTTC CGATCTCCTC ACGGCGGTTC GGGACCGAAG AGTGCCGCAA
TGGGCCCGGG CCAACCCCCG GTGGGTCACC CAGCTCTAA
 
Protein sequence
MLLAGVAVVG LRELTGRYEQ AVAQEQLLDP SARQDQTDLD GPLNYLLIGT DRWRSSGTTD 
RRSDAILVVH VPTGGREAFL MSVPRDLLVS IPAAPGYGGG QDKINSAFTH GGGGQPGARL
LSATLAHLAG IRFDGAAVID FSGFREVIDL LGGVRICVDT EFRSIHTDRV FAPGCQEMDG
PTALDFVRQR HNLPNGDYDR QTHQQQLLRA VLQRTSETSL RSDPIRLDRV IRAVGSSLTV
DTNGVPLADL VMAFRDLPTD ALRGIQMPSH SQTIGQVSYV VLDDGGSDLL TAVRDRRVPQ
WARANPRWVT QL