Gene Sros_4043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4043 
Symbol 
ID8667337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4500940 
End bp4502454 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content79% 
IMG OID 
Productputative transcriptional regulator, PucR family 
Protein accessionYP_003339694 
Protein GI271965498 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0389127 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00142198 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGACCGACT TGAGGGGGCG TGCGGGGTGG GCCACGCCGG TCACGGTGGC CGAGCTGGTC 
AACGCCGGCC CGCTGGCCGG GGCTCGGATG TACGGCACGG GGGAGAACCC GGTCCGCCAG
GTCCGGATCG TCGACGACCT GGCGGTGTTC GGGTCGGTGG TCCCGCACAC GGCGGTGGTG
CTGATCGGGG CCGCGGCCGG CGGCGGCTGG GCGGTGGAGA TGGCCATGCG CCGGGCGTGG
GAGCAGGCCG CCGCGTGCGT GATCGCCTCC TCGGCGGGCG GGGGCGCGGG GTCCGGTGAG
GTGCTGGCCG AACGGCTCGG CGTCACGTTG ATCGTCGTTG ACGAGGACCC GCTGGTGACG
GCGGTGCGGG TGGCGTCGGC GGCGGCCCGT CCGGAGGCGG CCAGGACCCA GCTGGTGGCC
CGCTGCGCGA CCAGGCTGGC GGAGGCGGGA TCGTCGGCCC GGCGGGTCCT CGGCGTGCTC
AACGCCGAGC TGTCCGGGAC CGCCGTGGCC TTCCTGGACC CGTACGGCTC GCACCTGGCG
GGCCGTCGCG GCCAGGGGCA CTCCCTGGCG GAGGTGGAGG TGCCCGACGC GGAGGGCAGG
CCGCTGGGGG TGCTCGTGGC CTACGGCTCC TTCCGCTCGC CGGGCTGGCC GTCCGTGGTG
AGCGCCGTCC TGGCCCTGGC GGCCGCCCCG CTGGCCGCGT GGGCCGCCAC CGGACGCCTG
GCCGCCGAGC GGGACACGGC GCTCCAGTCG GCCCTCGCCG CGCGGCTGCT GGCCCGGGCC
ACGCGGCCGG GGGCCTTCAC GGAGCCCGCC GGTGCGCCGG AGGCCGCCCC GGCGGGCGGC
GGCCCGGCCG GTGCGGCGGC GGAGGAGGGG GCGGACGGAG TGCTCGGGCG GGCGGTCGCC
CTGGGCTGGC CGGTCACCGG TCCGCTCACC GGCTACGCCG TCCGGCCCTT CGACGACGGG
TGGGAAGGGG CAGGGGTGAT CGGGCCGGTC ATCGCCGCCT CCCTGGGGCC CGGTCCCGTG
CTGCGCCGCG GCGGGAGCTG GGCGGGCTGG TCGGGGCTGC CGCCGGACCG GCTCGCCGGA
CGCCTGGCCG AGTGCCTGCG GGCCCTGCCC GCGCCCTGCT CGGCGGGCGT CGGCGCCCAG
GCGGCCGACC TGAGCGGTAT GGAGGAGTCG CTGCTCGGGG CGGAGGCCGC CGCCCTCGTC
TCCCCGGCGG GCGTGGTCGC CCGCGCCGAC CGGCTGGGCC CGGCCAGGCT GCTGGCCGCG
CTGCCCTCCG GAGTCCTCCG CGCCCAGGCG CGGGTGATCC TCGGGCCGCT GCTGGCGGTG
GACCGGGAGG GGACGCTGCT GGAGACCCTG GCCGCCGTCC TGGACGAGGG AGGGGCCTCG
CGGGCGGCCG ATCGCCTCGG CGTCCACCGC AACACCGTTA CCACCCGCCT CGACCGGATC
CGCGCGGCCG GGTTCGACGT CGACGACCCG GCGACCCGGC TCGCCCTGCA CCTGGCCTGC
CACGTGCTGC GCTGA
 
Protein sequence
MTDLRGRAGW ATPVTVAELV NAGPLAGARM YGTGENPVRQ VRIVDDLAVF GSVVPHTAVV 
LIGAAAGGGW AVEMAMRRAW EQAAACVIAS SAGGGAGSGE VLAERLGVTL IVVDEDPLVT
AVRVASAAAR PEAARTQLVA RCATRLAEAG SSARRVLGVL NAELSGTAVA FLDPYGSHLA
GRRGQGHSLA EVEVPDAEGR PLGVLVAYGS FRSPGWPSVV SAVLALAAAP LAAWAATGRL
AAERDTALQS ALAARLLARA TRPGAFTEPA GAPEAAPAGG GPAGAAAEEG ADGVLGRAVA
LGWPVTGPLT GYAVRPFDDG WEGAGVIGPV IAASLGPGPV LRRGGSWAGW SGLPPDRLAG
RLAECLRALP APCSAGVGAQ AADLSGMEES LLGAEAAALV SPAGVVARAD RLGPARLLAA
LPSGVLRAQA RVILGPLLAV DREGTLLETL AAVLDEGGAS RAADRLGVHR NTVTTRLDRI
RAAGFDVDDP ATRLALHLAC HVLR