Gene Strop_3895 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_3895 
Symbol 
ID5060373 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp4457180 
End bp4458244 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content66% 
IMG OID640476152 
ProductDNA-directed RNA polymerase subunit alpha 
Protein accessionYP_001160703 
Protein GI145596406 
COG category[K] Transcription 
COG ID[COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit 
TIGRFAM ID[TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGCGTCA TATGGCGGGC GCCCCGGAAG AAAAGAGAAG AGATGCTCAT CTCCCAGCGA 
CCCTCGCTCT CCGAGGAGTC GATCAACGAG ACCCGGTCCC GGTTCACCAT CGAGCCGCTG
GAGCCCGGCT TCGGCTACAC CCTGGGTAAC TCGCTGCGCC GGACGCTGCT GTCGTCCATT
CCCGGCGCGG CGGTGACCTC GATCAAGATC GATGGTGTGC TGCACGAGTT CACCACGATC
CCCGGGGTCA AGGAGGATGT GGTCGAGCTC GTCATGAACA TCAAGGAGCT CTGCGTCAGC
TCCGAGCACG ACGAGCCGGT CAGCATGTAC CTGCGCAAGC AGGGCCCGGG TGACGTGACC
GCGGGTGACA TCCAGCCCCC GGCTGGCGTC TCGGTGCACA ACCCGGACCT GAAGCTCGCC
ACCCTCAACG GCAAGGGCCG GCTCGACATG GAGCTGACCG TCGAGCGGGG CCGTGGCTAC
GTCACCGCGG CGCAGAACAA GCAGGCGGGT GCCGAGATCG GTCGGATCCC GGTCGACTCG
ATCTACTCGC CGGTGCTCCG GGTGACCTAC CGGGTCGAGG CGACCCGGGT CGAGCAGCGG
ACCGACTTCG ATCGGCTGAT CATTGACGTC GAGTCCAAGC CGTCGATGGG GCCCCGTACG
GCCCTGGCCT CGGCCGGTTC CACGCTGGTC GAGCTCTTTG GCCTGGCCCG GGAGCTGGAC
GAGACCGCAG AGGGCATCGA CATCGGGCCG TCCCCGCAGG ACGCCCAGCT GGCAGCGGAT
CTGGCGCTGC CGATCGAGGA GCTGGACCTC ACCGTCCGCT CCTACAACTG CCTCAAGCGC
GAGGGCATCA ACACCGTTGG TGAGCTCATT GGGCGTACCG AGGCTGACCT CCTCGACATC
CGTAACTTCG GCCAGAAGTC GATCGACGAG GTCAAGATGA AGCTCGCTGG GATGGGCTTG
GGGCTGAAGG ACTCGGCCCC GAACTTCGAC CCGGCGAACG TCGTGGACGC CTTCGGCGAG
GCCGACTACG ACACCGAGGA CTACCGCGAG ACTGAGCAGC TGTAA
 
Protein sequence
MGVIWRAPRK KREEMLISQR PSLSEESINE TRSRFTIEPL EPGFGYTLGN SLRRTLLSSI 
PGAAVTSIKI DGVLHEFTTI PGVKEDVVEL VMNIKELCVS SEHDEPVSMY LRKQGPGDVT
AGDIQPPAGV SVHNPDLKLA TLNGKGRLDM ELTVERGRGY VTAAQNKQAG AEIGRIPVDS
IYSPVLRVTY RVEATRVEQR TDFDRLIIDV ESKPSMGPRT ALASAGSTLV ELFGLARELD
ETAEGIDIGP SPQDAQLAAD LALPIEELDL TVRSYNCLKR EGINTVGELI GRTEADLLDI
RNFGQKSIDE VKMKLAGMGL GLKDSAPNFD PANVVDAFGE ADYDTEDYRE TEQL