Gene Strop_4402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_4402 
Symbol 
ID5060888 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp4979791 
End bp4980936 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content73% 
IMG OID640476665 
Productsigma-70 region 2 domain-containing protein 
Protein accessionYP_001161208 
Protein GI145596911 
COG category[K] Transcription 
COG ID[COG4941] Predicted RNA polymerase sigma factor containing a TPR repeat domain 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.270648 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.805777 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATGGGG AACAGCTGCG GGACCTCGTA CCAGCGGTGA TCGGCATCCT CGTCCGGCGC 
GGTGCGGATT TCGCGTCGGC CGAGGATGCC GTGCAGGACG CCCTGGTCGA GGCGGTGCGC
GGCTGGCCGG ACAGTCCGCC GCAGGACCCC AGAGGCTGGT TGATCACCGT GGCCTGGCGC
AAGTTCCTCG ACGCGGCCCG CGCCGACACC TCCCGGCGCC GGCGCGAGGT ACGCGTCGAG
GGCGAGCCCG CACCGGGGCC GGTCGAGGCG GTGGACGACA CGCTTCAGCT GTACTTCTTG
TGCGCTCACC CCTGTCTGAC ACCGGCCTCG TCCGTCGCGC TCACGCTGCG CGCGGTCGGC
GGCCTGACCA CGCGTCAGAT CGCGCGGGCC TACCTCGTGC CGGAGGCGAC CATGGCCCAG
CGGATCAGCC GGGCCAAGCG TACGGTCTCG GGTGTCCGCC TCAACCAGCC CGGTGATGTC
GCCACGGTGG TGCGCGTGCT CTATCTGGTC TTCAATGAGG GCTACTCCGG GGATGTCGAC
CTTGCCGCCG AAGCGATCCG GCTCACTCGT CAACTCGCTG CCAAGATTAG CCACGAGGAG
GTCGCAGGCC TGCTGGCGCT GATGCTGCTG CACCACGCGC GACGGCCGGC GCGCACCGAC
TCCGACGGCC GGCTCGTGCC TCTTGCCGAG CAGGACCGCA GCCGGTGGAA CCGCCACCTG
ATCGCTGAGG GCGTCGAGCT GCTCCAGAAA GCCCTCGCCC GGGACCGGCT GGGAGAGTTC
CAGGCCCAGG CCGCCATCGC CGCACTGCAC GCCGACGCCC GGACGGTCGA GGAGACCGAC
TGGGTGCAGA TCGTCGAGTG GTACGACGAC CTGGTGCGCC TGACCGACAG CCCGGTGGTT
CGCCTTAACC GGGCGGTCGC CCTCGGGGAG GCCGACGGCC CGAGGGCCGG CCTGGCGGCC
CTGGCCGGGC TCGACCCCGC CCTGCCCCGG CACACCGCCG TCGCGGCCTA CCTGCACGAG
CGGGCGGGCG ACCCGGTGAC CGCGGCCCGG CTCTACGCCG AGGCCGCCCG CTCGGCACCG
AGCCTCCCCG AGCGCGACCA CCTCATCCGA CAGGCCGCCC GACTCAACTC GCCACCGCGT
CGTTGA
 
Protein sequence
MNGEQLRDLV PAVIGILVRR GADFASAEDA VQDALVEAVR GWPDSPPQDP RGWLITVAWR 
KFLDAARADT SRRRREVRVE GEPAPGPVEA VDDTLQLYFL CAHPCLTPAS SVALTLRAVG
GLTTRQIARA YLVPEATMAQ RISRAKRTVS GVRLNQPGDV ATVVRVLYLV FNEGYSGDVD
LAAEAIRLTR QLAAKISHEE VAGLLALMLL HHARRPARTD SDGRLVPLAE QDRSRWNRHL
IAEGVELLQK ALARDRLGEF QAQAAIAALH ADARTVEETD WVQIVEWYDD LVRLTDSPVV
RLNRAVALGE ADGPRAGLAA LAGLDPALPR HTAVAAYLHE RAGDPVTAAR LYAEAARSAP
SLPERDHLIR QAARLNSPPR R