Gene Sros_1651 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1651 
Symbol 
ID8664928 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp1765618 
End bp1767732 
Gene Length2115 bp 
Protein Length704 aa 
Translation table11 
GC content72% 
IMG OID 
Producttranscription termination factor Rho 
Protein accessionYP_003337385 
Protein GI271963189 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.271851 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.770035 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGACA CCACCGAACT CCTCTCCGAC GCCGCCGGCG CACCGCCGGT GGCCGGCGAC 
ACCCCCACCC GCGCCGCGGC CAGGCCGCGC CGTCGCTCCG GCACCGGCCT GTCCGCCATG
GTGCTGCCCG AGCTGCAGGC CATGGCCTCC GGGCTTGGCA TCAGCGGGAC AGGGCGGATG
CGCAAGAGCC AGCTCATCGC GGCCATCCAG GAGAAGCAGG GCGTAGCACC CGAGAGCGCC
CCTGCTCCCG CTCCCGCCGC GGTCAAGGAA ACTCCGCCGG CCCCGGCCGT CGCGGCCGAG
CCGGTCGCCG AGCGCCCGGC CCGCAGCCGC AGGGAACGTT CCCGGCCCGC CGCCGTCGCC
GAACCCGTGG CCGAGCAGGC GCCCGTCGCG CCCGAGCCCG TGGCCGCAGC CGCTCCCGTC
GTGGAGCAGC AGGCCGATGC CGCTCAGGCC GACACCCGCG CCGAGCGCGG TGAGAGCCGC
CGTGAGCGTG GCGACCGCCG CAGCCGCAAC AGCGACCGTG GCGAGCGCAC CGGCGACCGC
CGCGACCGTG GCGACAGCCG TGGCGAGCGC GGTGACAACC GTGGCGAGCG TGGCGACCAG
CGCGCCGCCG ACGCGGGCCA GAACCGTGGC GAGCGCGCCG ACCGTGGCGA GCGCGCCAGC
GACCGTAACG ACCGTGGCGA GCGCGCCGAC CGTGGCGAGC GCGTCAGCGA CCGTAACGAC
CGCGGTGAGC GCGCCGACCG CGGTGACCGC GTCAGCGACC GCGGTGACCG TGGCGAGCGC
GTCAGCGACC GCGGCGACCG TACCGACCGT GGTGACCGCG GCGACCGTGG CCCGCAGAAC
CGTGGTCCGC AGGACCGCGG CGACCAGCAC GGCGTCGGCG AGGACGACGA CCGCCGTGGC
CGCCGTGGCC GGTTCCGGGA GCGGAACCGT CGCGGCCGTG ACCGCTTCGA CAGCAACGAG
CCCGTGGTCG GCGACGACGA CGTCCTGATC CCGATCGCCG GCATCCTGGA CATCCTCGAC
AACTACGCGT TCGTGCGGAC CAGCGGTTAC CTGCCGGGCT CCAACGACGT CTACGTCTCG
CTGGCCCAGA TCCGCCGTAA CGGCCTGCGC AAGGGTGACG TCGTCACCGG CGCCGTCCGC
CAGCCCCGCG ACGGCGAGCG CCGCGAGAAG TTCAACGCGC TCGTCCGCCT CGACACGGTC
AACGGCATGG ACCCGGAGCA GGCGCGGCAG CGGCCCGACT TCAACAAGCT CGTCCCGCTC
TACCCGCAGG AGCGGCTGCG CCTGGAGACC GAGCCGAACG TTCTGACCAC CCGGATCATC
GACATGGTGG CGCCGATCGG CAAGGGCCAG CGCGGCCTCA TCGTCTCCCC GCCCAAGGCG
GGGAAGACCA TGGTGCTCCA GGCGATCGCC AACGCGATCA CCCGGAACAA CCCCGAGTGC
CACCTGATGG TCGTCCTGGT CGACGAGCGT CCGGAAGAGG TCACCGACAT GCAGCGGTCG
GTGAAGGGCG AGGTCATCCA CTCGACCTTC GACCGTCCCG CCGAGGACCA CACCACGGTC
GCCGAGCTCG CCATCGAGCG CGCCAAGCGT CTGGTGGAGC TGGGCCACGA CGTCGTCGTG
CTGCTCGACT CGATCACCCG TCTGGGCCGG GCCTACAACC TGGCGGCCCC GGCCTCCGGC
CGGATCCTGT CCGGTGGTGT CGACTCCACC GCGCTCTACC CGCCGAAGCG CTTCTTCGGC
GCCGCCCGCA ACATCGAGAA CGGCGGCTCG CTGACGATCC TCGCCACGGC GCTGGTCGAG
ACCGGCTCCA AGATGGACGA GGTCATCTTC GAGGAGTTCA AGGGCACCGG AAACCTGGAG
CTCAAGCTCA ACCGCTCGCT CGCCGACAAG CGGATCTTCC CCGCGGTGGA CGTCGACGCG
TCCGGCACCC GTAAGGAAGA GATCCTCATG GGCAAGGACG AGCTGCAGAT CACCTGGAAG
CTGCGGCGCG TGCTGCACGC CCTGGACATG CAGCAGGCCC TGGAGCTTCT CCTGGAGAAG
ATGAGGGAGA CCAAGTCCAA CGCGGAGTTC CTCCTCCAGG TCCAGAAGAC GACGGTCAGC
TCCGACCGCG ACTGA
 
Protein sequence
MSDTTELLSD AAGAPPVAGD TPTRAAARPR RRSGTGLSAM VLPELQAMAS GLGISGTGRM 
RKSQLIAAIQ EKQGVAPESA PAPAPAAVKE TPPAPAVAAE PVAERPARSR RERSRPAAVA
EPVAEQAPVA PEPVAAAAPV VEQQADAAQA DTRAERGESR RERGDRRSRN SDRGERTGDR
RDRGDSRGER GDNRGERGDQ RAADAGQNRG ERADRGERAS DRNDRGERAD RGERVSDRND
RGERADRGDR VSDRGDRGER VSDRGDRTDR GDRGDRGPQN RGPQDRGDQH GVGEDDDRRG
RRGRFRERNR RGRDRFDSNE PVVGDDDVLI PIAGILDILD NYAFVRTSGY LPGSNDVYVS
LAQIRRNGLR KGDVVTGAVR QPRDGERREK FNALVRLDTV NGMDPEQARQ RPDFNKLVPL
YPQERLRLET EPNVLTTRII DMVAPIGKGQ RGLIVSPPKA GKTMVLQAIA NAITRNNPEC
HLMVVLVDER PEEVTDMQRS VKGEVIHSTF DRPAEDHTTV AELAIERAKR LVELGHDVVV
LLDSITRLGR AYNLAAPASG RILSGGVDST ALYPPKRFFG AARNIENGGS LTILATALVE
TGSKMDEVIF EEFKGTGNLE LKLNRSLADK RIFPAVDVDA SGTRKEEILM GKDELQITWK
LRRVLHALDM QQALELLLEK MRETKSNAEF LLQVQKTTVS SDRD