Gene Sare_0420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0420 
Symbol 
ID5708397 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp480619 
End bp481551 
Gene Length933 bp 
Protein Length310 aa 
Translation table11 
GC content69% 
IMG OID641269945 
ProductECF subfamily RNA polymerase sigma-24 factor 
Protein accessionYP_001535340 
Protein GI159036087 
COG category[K] Transcription 
COG ID[COG1595] DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family
[TIGR02952] RNA polymerase sigma-70 factor, TIGR02952 family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.512317 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00742101 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGACCT TCGGCTACAT GGAACGGCCG GTCGGCCTGA GCGACATACC TTCCCGGTCC 
GCGGTGAACG AACGGTCGAC TCCCGGACCC CGGGAGCAGG GCCAACTCTC CACGTGGGGC
GAAAGCGGGG TACGCAGCCG TCCACATCAC AATGAGGCGC CCCCCCGACC CTCGATTCCG
GGCGGAAACG CCAAGCCGGT CGGAACCCGG GTCGCGTCGC CGGCCCGACC GACGATGCCC
GTGCAGGGCC GCCGGGCGAC CGACCCACCC GCCACCGCCG ACCCCGCCAC CACCGATACG
GCGGTACTGC CCGCACTGCC GGCCAGCACG CCCGCCACCG GCTTCCCGAG CCGCCCCGAC
CCGTCCGACC CGGCGACCGA GATCTGGACA TTGGTCGAAC GGGCGCAGGC CGGGGAGGCC
GAGGCGTTCG GCCTGATCTA CGACCGGTAC GTGGACACCG TCTTCCGGTT CGTCTACTTC
CGGGTGGGTA ACCGCCAACT GGCCGAGGAC CTCACCTCCG ACACCTTCCT GCGGGCATTG
AAGCGAATCG GTAGCTTCAC CTGGCAGGGC CGAGACCTCG GGGCCTGGCT GGTGACGATC
GCCCGCAACC TGGTGGCGGA CCACTTCAAA TCCGGCCGCT ACCGGCTCGA GGTGACCACT
GGCGACGTAC TCGACGCCGA ACGCGAGGAC CGCGGCCCGG AAGGCAGCCC GGAGGCCGCC
GTAGTCGAAC ACATCACCAA TGTGACCCTG CTCAGCGCCG TCAAGCAGCT CAACCCGGAG
CAGCAGGAGT GCATCGTGCT CCGCTTCCTC CAGGGCTTCT CGGTGGCGGA GACCGCCCGG
GCAATGGGCA AGAACGAGGG TGCGATCAAG GCGTTGCAGT ACCGGGCGGT TCGGGCCCTC
GCCCGGCTAC TCCCGGACGG CTTCCGGATG TAG
 
Protein sequence
MTTFGYMERP VGLSDIPSRS AVNERSTPGP REQGQLSTWG ESGVRSRPHH NEAPPRPSIP 
GGNAKPVGTR VASPARPTMP VQGRRATDPP ATADPATTDT AVLPALPAST PATGFPSRPD
PSDPATEIWT LVERAQAGEA EAFGLIYDRY VDTVFRFVYF RVGNRQLAED LTSDTFLRAL
KRIGSFTWQG RDLGAWLVTI ARNLVADHFK SGRYRLEVTT GDVLDAERED RGPEGSPEAA
VVEHITNVTL LSAVKQLNPE QQECIVLRFL QGFSVAETAR AMGKNEGAIK ALQYRAVRAL
ARLLPDGFRM