Gene Strop_3646 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_3646 
Symbol 
ID5060121 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp4173922 
End bp4175949 
Gene Length2028 bp 
Protein Length675 aa 
Translation table11 
GC content71% 
IMG OID640475901 
Producttranscription termination factor Rho 
Protein accessionYP_001160455 
Protein GI145596158 
COG category[K] Transcription 
COG ID[COG1158] Transcription termination factor 
TIGRFAM ID[TIGR00767] transcription termination factor Rho 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCTGC CGGAGCTTCA GAGCCTGGCC GCGTCGCTCG GCATCTCGGG CACGGCTCGC 
ATGCGCAAGG GTGAGCTGAT CAGCGCGATC ACCGAGCGCC AGGGTGGCGG GGCAGCCACC
GGAACCCCTC GACCGCGGGC CGAGGTTGCG GCTGCCGCGG CCCCCGCCCG GGGGGAGGTC
CACGCGGAGG TCCGGGAGTC GGGCGAGCGG GCGGAGACCG AGTCGCGTCC CGCCGAGCAG
CCGACGACCA CCCCGACCAC CGGCCGGGCC CGTGGCCGGC GTAGCCGTGC GGTCAGTGAG
GCCGGTGAGG CCCGCGCTGA GACGCGACCG GAGGAGGCCG AGGCCGGCGA ACGCCGGGAG
CGTGGTGAAG GCCGCGGTGG CCGGGACCGT GCTGAGCGCG GCGAGCGTGC TGAGCGTGGC
GAGCGTGCTG AGCGGGGCGA GCGTGCTGAG CGTGGCGAGC GTGCTGAGCG GGGCGAGCGT
GCTGAGCGGG GCGAGCGTGC TGAGCGGGGC GAGCGTGCTG AGCGGGGCGA GCGTGCTGAG
CGTGGCGAGC GTGCTGAGCG GGGCGAGCGT GCTGAGCGGG GCGAGCGTGC TGAGCGGGGC
GAGCGTGCTG AGCGTGGCGA GCGCGGTGAG CGTGGCGAGC GTGCTGAGCG GGGCGAGCGT
GCTGAGCGGG GCGAGCGCGG TGACCGTGGC GAGCGTGCTG AGCGTGGCGA GCGCGGTGAC
CGTGGCGAGC GTGCTGAGCG GGGCGACCGG AACGACCGTG GCCAGCGTGA CAACGACGGC
GAGGAGGAGA ACGAGGGCGG CGGTCGGCGT GGCCGGCGCA GCCGCTTCCG GGATCGTCGG
CGCGGCCGTG GCGACCGGGA CGGCGACGGT GGGCGGGAGC CCCAGGTCAG CGAGGACGAT
GTCCTCGTCC CGGTAGCGGG CATCATCGAT GTGCTCGACA ACTACGCCTT CGTCCGGACC
ACCGGCTATC TGGCCGGGCC GAATGACGTT TACGTCTCTA TGTCCCAGAT TAAGCGGTAC
GGTCTGCGGC GTGGTGACGC GATCACCGGT GCCGTCCGCG CGGCGCGGGA GGGCGAGCAG
CGGCGGGACA AGTACAACCC GCTGGTCCGG CTGGACACCA TCAACGGGAT GGAGCCGGAG
GAAGCGAAGC GCCGGCCGGA GTTCTATCGA CTCACCCCGC TCTACCCGCA GGAGCGGCTG
CGGCTGGAGA GCGAGCCGCA CATCCTCACC ACTCGGGTGA TCGACCTGGT GATGCCGATC
GGCAAGGGCC AGCGGGCGCT TATCGTTTCG CCGCCCAAGG CCGGCAAGAC GATGGTGTTG
CAGGCGATCG CGAACGCGAT CACCCACAAC AACCCGGAGT GCCACCTGAT GGTGGTGTTG
GTGGACGAGC GTCCAGAAGA GGTCACCGAC ATGCAGCGGT CGGTGAAGGG CGAGGTCGTC
GCGGCAACCT TCGACCGGCC GCCGCAGGAC CACACCACCG TCGCCGAGTT GGCGATTGAG
CGGGCGAAGC GCCTGGTCGA GCTGGGCCAC GACGTGGTCG TGCTGCTGGA CTCGGTGACG
CGGCTTGGTC GGTCGTACAA CCTGGCGGCG CCGGCCAGCG GCCGGATCAT GTCGGGTGGT
ATCGACTCCA CCGCGTTGTA CCCGCCGAAG CGGTTCCTGG GTGCGGCCCG CAACATCGAA
AACGGTGGTT CGCTGACCAT CCTCGCCACC GCGCTGGTGG AGACCGGTTC GACGGCGGAC
ACGGTCATTT TCGAGGAGTT CAAGGGCACC GGTAACGCGG AGCTGAAGCT GGATCGGAAG
ATCGCCGACA AGCGGACCTT CCCGGCCATC GACATCCACC CGTCCGGTAC GCGTAAGGAG
GAGATCCTGC TCGCGCCGGA GGAGCTGGCC ATCGTCCACA AGCTCCGGAA GGTGCTGCAC
GCGCTGGACT CACAGGCCGC GCTGGACCTG CTGCTGGACC GGCTCAAGAA GTCGCGGACC
AACATCGAGT TCCTGATGCA GATTGCGAAG TCGACGCCAG GGGAGTGA
 
Protein sequence
MLLPELQSLA ASLGISGTAR MRKGELISAI TERQGGGAAT GTPRPRAEVA AAAAPARGEV 
HAEVRESGER AETESRPAEQ PTTTPTTGRA RGRRSRAVSE AGEARAETRP EEAEAGERRE
RGEGRGGRDR AERGERAERG ERAERGERAE RGERAERGER AERGERAERG ERAERGERAE
RGERAERGER AERGERAERG ERAERGERGE RGERAERGER AERGERGDRG ERAERGERGD
RGERAERGDR NDRGQRDNDG EEENEGGGRR GRRSRFRDRR RGRGDRDGDG GREPQVSEDD
VLVPVAGIID VLDNYAFVRT TGYLAGPNDV YVSMSQIKRY GLRRGDAITG AVRAAREGEQ
RRDKYNPLVR LDTINGMEPE EAKRRPEFYR LTPLYPQERL RLESEPHILT TRVIDLVMPI
GKGQRALIVS PPKAGKTMVL QAIANAITHN NPECHLMVVL VDERPEEVTD MQRSVKGEVV
AATFDRPPQD HTTVAELAIE RAKRLVELGH DVVVLLDSVT RLGRSYNLAA PASGRIMSGG
IDSTALYPPK RFLGAARNIE NGGSLTILAT ALVETGSTAD TVIFEEFKGT GNAELKLDRK
IADKRTFPAI DIHPSGTRKE EILLAPEELA IVHKLRKVLH ALDSQAALDL LLDRLKKSRT
NIEFLMQIAK STPGE