Gene Sare_3599 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3599 
Symbol 
ID5707589 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4153738 
End bp4154961 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content69% 
IMG OID641273023 
ProductXRE family transcriptional regulator 
Protein accessionYP_001538388 
Protein GI159039135 
COG category[K] Transcription 
COG ID[COG1396] Predicted transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.34153 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000152397 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGACGAGC TACCCATCGG GCGACGGGTG GCCTACTGGC GGGGGCGACG CAAGATGTCC 
CAGCAGGTCT TCGCGGACCG GCTGGGCAAG TCGAAAAGCT GGGTGGACAA GGTCGAGCGT
GGGGTGCGCC GACTGGACAA GTTCTCCGTA ATCTACGAGA TCGCCGACAC CCTCCGGGTC
GACGTGCAAC TACTGCTCGG CAAGGACCCA GAACGGCGAT CCGATGCGCT CAACTGCATC
GACCCGAGCG AGGTGCAGGA AATCAGGGCG GCCCTGGAGC GGTACGACGC GATGAGTGCC
TACTTCGACG CGGCACCGTG CCCACCACCC TTGTCCGACA TGCGCAAGGC CGTCACCCAC
GCCTGGTTGA CCTATCAGTA TGGCCGGTAC GGGATGCTGA CCCGATCCTT GCCCAAGCTC
CTGCGGGACG CCCAGGCGGC CGACGCCGGC TTCGGCGGCG ACGACAACCG CGAAGCGGCA
CACCTGCTCG GGCAGGTCTA CCAGATCGCC TCGTCGGTCC TGCGCAAGCT GGGGGAGTGT
GACCTGGCAT GGCTGGCCGC CGACCGGTCG ATGGCGGTCG CCCAGCGGGC CGACGACCCA
CTGCTGGCGG GGATCGCCAC CACCCGGGTC GGTAACGCGT TGGTGGCGAT GGGGCGCCCC
CGGCCGGCGC TGGAGGTGAA CGTCGCGATC GCGAACCGCC TGGCCCCTGG CAGTCACGAC
GAGGCCATCC CGGACCGGCT GTCGGTCTAC GGCATGCTCC TCCTCCAAGG AGCGATGGCC
GCATCCCGGA TCGGCGACGC CGCCACAGTG GACGACCTAC TGAGCGGCGC GGACGAGGCC
GCCGCCCTGC TTGGCGGTGA CCAGAACCAT TACTGGACGT CCTTCGGACC CACCAACGTG
AAGCTTCACC GCGCCGCAGC GGCGGTGGAG CTTGGCGACG GCGGCCGAGC GGTGGAGGTC
CACCAACGGA TCGGTCCCGA ATTCAACGCG CTGCTGCCCG AACGTCGCGC GCACCACCTG
CTCGACATCG CCCGGGGCTA TGCCCAGGTC GGTGACGTGG CAAACGCCGG CGAGATGCTG
CTGCGGGGCG ACCGGCTCGC CCCGTCGGAG ATCCGTTGCC GGCCCATCGC CCACGAGGTG
ATGTCCGACA TCCTGCGTCG CACTCGTGGC GCCCCACCCG CCTCGATAGC GGAGTTGGCT
GAACACATGG GAGTTGGGGT ATGA
 
Protein sequence
MDELPIGRRV AYWRGRRKMS QQVFADRLGK SKSWVDKVER GVRRLDKFSV IYEIADTLRV 
DVQLLLGKDP ERRSDALNCI DPSEVQEIRA ALERYDAMSA YFDAAPCPPP LSDMRKAVTH
AWLTYQYGRY GMLTRSLPKL LRDAQAADAG FGGDDNREAA HLLGQVYQIA SSVLRKLGEC
DLAWLAADRS MAVAQRADDP LLAGIATTRV GNALVAMGRP RPALEVNVAI ANRLAPGSHD
EAIPDRLSVY GMLLLQGAMA ASRIGDAATV DDLLSGADEA AALLGGDQNH YWTSFGPTNV
KLHRAAAAVE LGDGGRAVEV HQRIGPEFNA LLPERRAHHL LDIARGYAQV GDVANAGEML
LRGDRLAPSE IRCRPIAHEV MSDILRRTRG APPASIAELA EHMGVGV