Gene Sros_5001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5001 
Symbol 
ID8668295 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5526475 
End bp5527506 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content74% 
IMG OID 
ProductLacI family transcription regulator 
Protein accessionYP_003340543 
Protein GI271966347 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.282758 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.452434 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCCGAC CGACGATCGC CGACATCGCG GAGCGGGTCG GGGTTTCGAA GGGCGCGGTG 
TCCTTCGCGC TCAACGGGCG CCCGGGGGTC GGCGAGGCCA CCCGCGCGCG AATCCTGCAG
GTCGCGAAGG AGATGAACTG GCGTCCGCAC AGCGCCGCCC GGGCTCTGGG CGGGGCCCGC
GCGGAGGCCG TGGGCCTGGT CATCGCCCGG CCCGCCAGCA CCCTCGGCGT GGAGCCGTTC
TTCGCCCAGT TGTTGTCCGG GCTGCAGGCG GGGCTGTCGG CGCAGTCGGT GGCGCTGCAC
CTGCTGATCG TCGAGGACGT CGAGGCCGAG ACGGAGGTCT ACCGGGAGTG GGCCTCCGCG
CACCGGGTGG ACGGGTTCGT CATGGTGGAC CTCAAGGTCC GCGACCCGCG CATCGAGGTG
CTGGAGGATC TGGGGGTGCC CGCGGTGGTC CTGGGCGGCC CCGGCGGGCA CGGAAACCTG
TCCAGCGTGT GGGCCGACGA CCGCGGGGCG ATGCTGTCGG TCGTGGATTA CCTGGCCGCG
CTCGGCCACC GCAGGATGGC CCACGTCGCG GGCCTCCCGG CGTTCCTGCA CACCCAGCGG
CGCACGCGCG CGCTGCGCGA CCGCGCCGGG CGGCTGGACC TGGAGGAGGC GGTGTCGGTG
CACACCGACT TCAGCGACTC CGAGGGCGCC GCCGCGACCC GGGCGCTGCT GTCGCGCTCC
CGCCGGCCGA CGGCCATCGT CTACGACAGC GACGTGATGG CGCTGGCCGG GCTGGGGGTG
GCGGGCGAGA TGGGGGTGAA GGTGCCGGAC GAGCTGTCCA TCGTGGCCTT CGACGACTCG
GTGCTGACCC GGATCGCGCA CCCCGCGATC AGCGCGCTCT CCCGCGACAC CTTCGCCTTC
GGCAAGCAGT TCGCCGAGGT GATGCTGGAG GTCATCGCCG AGCCGGGGCT GCGCCGGGAC
GTCGAGACGG CGACCCCCCG GCTGGTGGTC CGGGAGAGCA CGGCGCCGCC GAGGGCTTGG
CAGGACGACT AA
 
Protein sequence
MRRPTIADIA ERVGVSKGAV SFALNGRPGV GEATRARILQ VAKEMNWRPH SAARALGGAR 
AEAVGLVIAR PASTLGVEPF FAQLLSGLQA GLSAQSVALH LLIVEDVEAE TEVYREWASA
HRVDGFVMVD LKVRDPRIEV LEDLGVPAVV LGGPGGHGNL SSVWADDRGA MLSVVDYLAA
LGHRRMAHVA GLPAFLHTQR RTRALRDRAG RLDLEEAVSV HTDFSDSEGA AATRALLSRS
RRPTAIVYDS DVMALAGLGV AGEMGVKVPD ELSIVAFDDS VLTRIAHPAI SALSRDTFAF
GKQFAEVMLE VIAEPGLRRD VETATPRLVV RESTAPPRAW QDD