Gene Sros_3343 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3343 
Symbol 
ID8666631 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3660225 
End bp3661274 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content72% 
IMG OID 
ProductLacI family transcription regulator 
Protein accessionYP_003339025 
Protein GI271964829 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.716413 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0750883 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGTATA GCGGCGGCCC GCCGGGCAAC GGCGCGGCCC GGCTGACCGA CATCGCCGCG 
CAGGCGGGGG TGAGCGAGGC CACGGTCAGC CGGGTGCTCA ACGGCAAGCC GGGCGTCTCG
GCCGTCACCC GGCAGGCCGT CCTGGCCGCG CTGGACGTCA TGGGCTACGA GCGGCCGCAG
CGGCTGCGCC AGCGCAGCAA CGGGCTGATC GGCCTGGTCA CGCCGGAGCT GGACAACCCG
ATCTTCCCGG CGTTCGCCCA GGCCTTCGAG AAGGCGCTGA CCCAGCACGG CTACACCCCG
CTGCTGTGCA CCCAGCTCCC CGGCGGAGCG GTGGAGGACG AGTTCACCGA GCTGCTCGTG
GAGCGCGGGG TCAGCGGCAT CATCTTCGTC TCCGGGCTGC ACGCCGACAT CACTGCGCGC
TCCGACCGCT ACACCCAGCT CATCGGGCAG GGCGTGCCCA TCGTCCTGCT CAACGGGCAC
GCCGGCGACG TCCCGGCGCC GTTCATCTCC CCGGACGACC GGGCCGCCGC GCGGCTGGCC
GTACAGCATC TGGTGGATCT CGGGCACGAG CGGATCGGCC TGGCCGTCGG CCCCGGCCGG
TTCGTGCCGG TGATCCGCAA GATCGAGGGT TACCGGCAGG CGATGGCGCA GTTGCTGGGG
GCGGGCGAGG TGGATGAGCT GATCTCGCAT TCGCTGTTCT CGGTTGAGGG GGGTCAGGCG
GCGGCGGCGC AGTTGCTGGA GCGGGGGTGC ACGGGCATCG TGTGCGCCTC GGATCTGATG
GCGCTGGGGG CGATCCGGGC GTGCCGGGAT CGGGGGTTGT CGGTTCCGGC GGACGTGTCG
GTGGTGGGGT TCGACGACTC GCCGCTGATC GCCTTCACCG ACCCGCCGCT GACCACCGTG
CGCCAGCCCG TCCAGTCGAT GGTGACCGCC GCGGTGCACA CCCTGCTGGA GGCCGTCTCC
GGCGCGCCCA TGCAGCACTC CGAGCTGATC TTCCAGCCGG AGTTCATCGT GCGGGGCTCG
ACGGGCTCGG GCCCGAAGAT CCTCCGCTGA
 
Protein sequence
MPYSGGPPGN GAARLTDIAA QAGVSEATVS RVLNGKPGVS AVTRQAVLAA LDVMGYERPQ 
RLRQRSNGLI GLVTPELDNP IFPAFAQAFE KALTQHGYTP LLCTQLPGGA VEDEFTELLV
ERGVSGIIFV SGLHADITAR SDRYTQLIGQ GVPIVLLNGH AGDVPAPFIS PDDRAAARLA
VQHLVDLGHE RIGLAVGPGR FVPVIRKIEG YRQAMAQLLG AGEVDELISH SLFSVEGGQA
AAAQLLERGC TGIVCASDLM ALGAIRACRD RGLSVPADVS VVGFDDSPLI AFTDPPLTTV
RQPVQSMVTA AVHTLLEAVS GAPMQHSELI FQPEFIVRGS TGSGPKILR