Gene Sare_3489 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3489 
Symbol 
ID5704760 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4024455 
End bp4025471 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content65% 
IMG OID641272916 
ProductLacI family transcription regulator 
Protein accessionYP_001538282 
Protein GI159039029 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.148473 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0487492 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCATC ACCCGGCCAA GCCCCCCCAG CCCCGACGGC CCACGATGGT GGATGTTGCG 
CGGCATGCCG GTGTCAGCCT CAAGACCGTC TCCCGCGTGG TCAACAACGA GCCGGCAGTG
GGCCCTGAAC TTGCCGGGCG AGTCATGACC GCCATCTCGG AGTTGGGGTT CCGGCGCAAC
GGCATTGCCC GCAACCTTCG ATCCCAGCAG GTCAGTTCGA CGATCGGCCT GCTCATCGAG
GAGATCGCCA ACCCCTTCTA CGCGGAGATC GCGGGTGTCG CGGCGGAGAT CGCGGCAGCA
CACCAGACAC TACTGATCAC CGCATCCTCG GAAGAGGATC CGGAGCGCGA ACGCACGCTG
TTGCTGGAGA TGGCGCAGCG CCGGGTGGAC GGCCTGCTCG TGGTTCCGGC CGGCGTGGAC
CATTCCTTCC TTCGACGGGA GGTGGAGATG GGTCTGCCGG CGGTGTTTCT CGACCGGCCG
CCGGGGCAGC TACTTGCCGA CGTGGTCCTC CTGGACAACC AGGGCGGAAG TCGCGCCGGA
ATCCGCTCAC TGCTCGACAC GGGGCACCGG CGGATCGGAA TTTTGTTGGG TTCGTTGAGC
GTGTACACAA TGCGTGAGCG ATTGGCCGGT GTACAGGCCG AGTTGTCAGG TGCCGGAATC
CCGTACGACG AGGCTTTGGT GCGGAGCGGA ATAGAGGGTC CCGAACAGGC CAGCCAGGCG
GTGGCAGACA TGTTCGATCT GGCCGACCAG CCCACGGCTT TCTTTTGCGC CAACAATCGC
CTCACGCTCG GCGCGCTCCA GGAACTCCAC CGCCGAGGAA ACGATGCGGC GTTGGTCGGT
TTTGACGACT TCGAACTGTC CCATCTGATG CCCAGACCGT TCACCGTCAT CGCCTACGAC
CAGCGTGAAC TGGCACGGGT CGCGACGGAG CAGCTGTTCG GGCGCATCGG CGGAAATCGG
TCGTGGCCGT CGACCACGGT CCTGCCGACG CATCTGGTGC AGCGCGGCCT GGACTGA
 
Protein sequence
MLHHPAKPPQ PRRPTMVDVA RHAGVSLKTV SRVVNNEPAV GPELAGRVMT AISELGFRRN 
GIARNLRSQQ VSSTIGLLIE EIANPFYAEI AGVAAEIAAA HQTLLITASS EEDPERERTL
LLEMAQRRVD GLLVVPAGVD HSFLRREVEM GLPAVFLDRP PGQLLADVVL LDNQGGSRAG
IRSLLDTGHR RIGILLGSLS VYTMRERLAG VQAELSGAGI PYDEALVRSG IEGPEQASQA
VADMFDLADQ PTAFFCANNR LTLGALQELH RRGNDAALVG FDDFELSHLM PRPFTVIAYD
QRELARVATE QLFGRIGGNR SWPSTTVLPT HLVQRGLD