Gene Sros_5231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5231 
Symbol 
ID8668525 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5747197 
End bp5748597 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content73% 
IMG OID 
Productputative transcriptional regulator, GntR family 
Protein accessionYP_003340743 
Protein GI271966547 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.203083 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGCAGCC CCGGCGCGGA AGGCGGCAGC GACCTTCATC TGGAGCTCGC CGGCGTCGGC 
GGCGCGCGCG TGCGGTTGAT GCGGGCCCTG CGCGAGGCGA TCCACTCGGG ACGGCTGGCG
CCCGGTACCC GGCTTCCGCC CTACCGCACC CTCGCCCTCG ACCTGGGAAT CGCCCGCAAC
ACCGTCGCCG CCGCGTACTC CGAACTCGTC GAGGAAGGCT GGCTCACGTC CCGTCAGGGT
TCCGGCACGC ACGTCGCGCA CCGCGCCGCA CCTCTGGAAC CCGGCCGCCG GCGCTCCCCC
GCGCGTCCGT CCCGCCTGCG GATCATCCAC GACCTGATGC CAAGCTCCCC TGACGCGGCC
GCCTTCCCGC GCTCGTCCTG GAGCGCCTCG GCCCGGCGCG CGCTGGCAGG CGCGCCCAAC
GACGCCTTCG GCGTCGGCGA TCCCCGTGGC CGGCTGGAGT TGCGCCAGGC GCTGGTGGAA
TACCTGGCGC GGACCAGAGG CGTGCGCACC AAGGCGGACC ACGTCGTGAT CTGCTCGGGC
TTCGCGCACG GGCTGCGGCT GGTGTGCCAC GTGCTGCGCG GCAGCATCGC GGTCGAGTCG
TACGGACTCG ACTTCCACCG CTCCATTCTC ACCGAGGCGG GACTGAAGAC CGTGCCGCTC
ACCGTGGACA TCCACGGCGC ACGGATCGAG GACCTTCCGG CCACGGGCGC GCAAGCCGTC
CTGCTGACGC CGGCCCATCA GTACCCCACG GGGGGAGCCC TGCACCCGCA GCGCCGGGCC
GCGGTCATCG ACTGGGCACG CGCCACGGGG GGTCTGCTGC TCGAGGACGA CTACGACGGC
GAGTTCCGTT ACGACCGCGA GCCGGTCGGC GCCGTGCAGG GGCTCGATCC GGACCGCGTC
GTCTACTTCG GCTCGACGAG CAAGAGCCTG TCGCCCGCGC TCCGGCTCGG CTGGATGGCG
TTGCCCGGCC GGCTGGTGGA CGACATCCTG GCCGCCAAGG GCTCGCGCGA ACTCTGGTCG
GGCGTCATGG ACCAGGTCAC CCTCGCCGAC TTCGTCGCCG GCGGCGCCTA CGACCGTCAG
CTGCGGCGGA TGCGCGGGAT CTACCGGCGC CGCCGTGACC TGCTCACGGC CATGCTCGCC
GAACGCGCCC CGCACATCAC CGTCAGCGGC ATCGCCGCGG GCCTGCACGC CGTCCTCGAG
CTTCCCCCGG GCACCGAGCA GGCCGCTCTC CGCGCCGCGC GCCGCCTCGG CATCGCTCTG
GACGGCTTGG GCCCCTATCT GCACCCCGGC AGCACGATGC CGCCCCGCGA CGGCCTGGTC
ATCGGATACG GCACACCACC CGAGCACGCG GTCACAGCCG CCCTGGAAGC CCTGTGTCTC
GCCTTGCCCG ACCCTCCGTA G
 
Protein sequence
MSSPGAEGGS DLHLELAGVG GARVRLMRAL REAIHSGRLA PGTRLPPYRT LALDLGIARN 
TVAAAYSELV EEGWLTSRQG SGTHVAHRAA PLEPGRRRSP ARPSRLRIIH DLMPSSPDAA
AFPRSSWSAS ARRALAGAPN DAFGVGDPRG RLELRQALVE YLARTRGVRT KADHVVICSG
FAHGLRLVCH VLRGSIAVES YGLDFHRSIL TEAGLKTVPL TVDIHGARIE DLPATGAQAV
LLTPAHQYPT GGALHPQRRA AVIDWARATG GLLLEDDYDG EFRYDREPVG AVQGLDPDRV
VYFGSTSKSL SPALRLGWMA LPGRLVDDIL AAKGSRELWS GVMDQVTLAD FVAGGAYDRQ
LRRMRGIYRR RRDLLTAMLA ERAPHITVSG IAAGLHAVLE LPPGTEQAAL RAARRLGIAL
DGLGPYLHPG STMPPRDGLV IGYGTPPEHA VTAALEALCL ALPDPP