Gene Sros_4464 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4464 
Symbol 
ID8667758 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4975558 
End bp4978806 
Gene Length3249 bp 
Protein Length1082 aa 
Translation table11 
GC content76% 
IMG OID 
ProductLuxR family transcriptional regulator 
Protein accessionYP_003340075 
Protein GI271965879 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0122059 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCTTC GCGTCACGCT GCTCGGCACC TTTCAGGCCT CCCGTGGTGA CGCCGCCCTG 
CCCGTCCGGG GTGCGCGGTT GCAGGGTCTG GTCGTACGGC TGGCGCTCGC CGGCGGGCGC
GCCGTCGAGC AGGGCGTCCT GGTCGACGCG ATCTGGGCCG AGGACCCGCC GACCGGCCCC
GCCCACGCCC TGCAGGCCCT CGTCTCGCGG CTGCGCCGGA CCCTCGGCCC GGCCGGCGAC
ATCGCGCAGG TCGCGGGCGG CTACCGGCTG GAGGTGGACG CCGCCGACGT GGACGCACTG
CGGTTCGAGC GGCTCGCCGC CGCCGGCCGT GACCGGCTGC GCGCCGGCGA CCCGAATGCC
GCGGCGGTCG TGCTCGGCGA GGCCGTGGCA CTGTGGGGCG ACCGTCCCGG CACCGAGCCC
ACGGTCATCG CCGCGGTAGC GCCCGCCGCC GCGACCCGGC TGGCCCACGC CTCGCTCGAG
GCCGTCACCG ACCTCGCCGC CGCCGAGCTG ACCCTGGGCC GTGCCGACGC GGCCGCCGCC
CGCCTGAGCG CCCTGCTCGC CGAGCACCCC GTCCACGAGC GCGCCGCCGC GCTGCTCATG
GACGCGCTCA CCGCCCAGGG ACGCCAGGCC GAGGCCCTGG CCCTGTACGA GCGGGTCCGC
GAAACCCTGG CCGACGTCCT CGGCACCGAC CCGGGCTCCG CCCTGCGCGA GCGCCACGTG
CGCCTGCTGC GCGCCGAACG GCCCGCCCCG GCCGCCGACG CCGCCCAGAC CAGGCCGAAT
AACCTGCCCG CACCGCTGAC CAGCTTCATC GGGCGCGACG ACGATCTGGC CCGGATCGAC
ACGCTGCTCA CCACCGGACG CCTGGTCACC GTGCTCGGTC CCGGCGGCGC CGGCAAGACG
CGCCTCGCCG TCGAGGCCGC CCGCCGCCAC CGCCACGAGT ACCGCGACGG CGCCTGGATG
ATCGACCTCG CCTCGGTCAT CGAACCGGCC AAGGTCGGCG CGGCCGTGCT CGCCGGGATC
GGGCTGCGCG GCGGCGCGAT GTTCGAGGCC CGGGTGCGCA TCGAGGGCGA CGAACTGGAC
GTGCTCGCCG ACCAGCTCGG CGGCCGGGAG AGCCTGCTGC TGATCGACAA CTGCGAGCAC
CTGATCGACG CCGTGGCCCA CCTGGTCGCG GCACTGCTGT CCCGCTGCTG CGGGCTGCGC
GTGCTCGCCA CCAGCCGCGA GCCCCTCGCG GTCGACGGCG AGGCACTGGT GCCGCTCGGC
CCGCTCGCGC TGCCCGGACC GGACGACGGC GTCGAACAGG CCCGGCGGGC GGCGTCGGTA
CGCCTGTTCA CCGAGCGGGC CGCAGCCGTA CGCCCCGGCT TCGAGGTGGA GGAGACGACG
CTGCCCGAGG TCCTGCGGGT GGTCCGCGGC CTGGACGGGC TGCCGCTGGC GCTGGAGCTG
GCCGCGGCCC GGCTGCGGAC GCTGTCGCTG CCCGAGCTGG CCGACGGGCT GTCGGACCGG
TTCCGGCTGC TGACCACCGG CAACCGCACC GCGCTGCCCC GGCACCGCAC GCTGCGCGCG
GTCATCGCCT GGAGCTGGCA ACTGCTGAGC GAGCACGAGC GTACGGTCGC CGAACGCGTC
TCGATCCTGC CCGGCGGCGT CACGCCGGCC TCGGCCACCG CGCTCTGCGC CGGCACCGCC
GTACCGGCCG CCGAGATCCC CGAGCTGCTC GCCGGCCTGG TCGACCGATC GCTGCTGCAG
CTCGCGCCCG ACCCGGGCCG CTACCGCATG CTGGAGACGC TGCGCGAGTA CGGCACCGAC
CGCCTGGCCG CGACGGGCGA CCTCGCCACC GTCCGCGACC TGGCCGCCGA CCACTTCGCC
GAGCTGACGG CCCGCTACGA CCCGCAACTG CGCGGGCCCG GCCAGCTGAC GGCCATGCGA
GTCATCGGCG CCGAATACGA CAACGCGCTG GCCGCCCTGC GCAGACGGTG CGACACCGGC
GACGCCTCCG GCGCGGTCGC TCTCGCCCTG AACCTGACCT GGTACTGGCA CATGTTCGGC
CGGCACCCCG ACGCGGCCTA CTGGCTGGGC GAGGCACTGG CGGTGCCCGG CGGCGAGCCG
ACACCCGAGC GCGACTGCGC TCAGGTGATC TCCTTGATCG ACCGGGTCGG CACCTGGCCG
GGGAGGTCCG CCGAGGAGAC CGCGGACGAC CAGGCACGGA TACGCGAGCC GGCCGACCGG
CTGCTCACCT ACCCGCAGCT GCCGGGTCCG TACGGCGCGC TCACCGCGCT CACGCTCGCG
TTCCTGCAGG AGGAGGAGGC CGCGTTCGCG ATCATCGAGC GCCTGGCCGA CGGCGACGAC
GTCTGGTTGT CCGGGCTGGC CCGCATGTTC CGGGCCCAGT TCGCCGAGAA CGCGGGCGAG
CTCGACAGGA CGCGTCCCGA CGTGGAGGCG GCCCTGGCCT GCTTCCGGCA GGCCGGCGAC
CGCTGGGGCC AGGCCACCGT GCTGCCGATG CGCACCCAGC TACGGCAGTA CGACGACGAC
CTCGACCGCG CGCTGGCCGA CCTGCGCCAG GCCCGGTCGC TGGCCGGCGA GTTCGGCTCG
CTCAACCTCG GCGACGAGGT GTTCATGGAC GTGCGCTGGA TCGACCTGCA CCTGCGGCGC
GGCGACACCG ACCTGGCGAT CGCGGTCATC GGCCCGGCCC GGGAGCGGGC GCTGCGCGCG
AACTCGCCGG GGATGCCGGC CCTGGTCGAC GCGTGGGAGG CCGTCTTCCG GGTGCGGCTG
GGCGACCTGG ACCGGGCGCG GGAACTGCTC GACGACGCCG AACGGGGCCT GCGCGGCGAC
ACCACCTTCC CCGGTGGCCA CGCGCGGACG CTGGTCGGCG GCGTACGGGC CTCGCTCTGC
CTGGAGACCG GTGACCTGGC CGGCGCGCAG AAGGCGCTGG AGAAGGCGTA CGCGGCGGCG
CTGGAGACCC GGGACCTGCC GATCCTGTCG CTGGTGGCGG TGAACGCCGC CGCACTCGCC
GAGGCACACG GACGGCATCA GGACTCGGCC GTCCTGCTCG GCGCCGCCTC CCGGCTGCGC
GGCGCTCACG ACCACACCCA TCGGCAGGTC CGCGAGCTCA CCCGCCGAGG GCGGGCCGCG
CTGGGCGAGG AAGCCTTCGC CGCGGCGTAC GGGAAGGGGT GGAAGCTGGA CGGGAAGACG
GCCGTGACCG AAGTCGACCC GGCCCGGCTA CACCGGGAAC TCAGGACGGC TCACCCGGAT
CCCTGGTGA
 
Protein sequence
MRLRVTLLGT FQASRGDAAL PVRGARLQGL VVRLALAGGR AVEQGVLVDA IWAEDPPTGP 
AHALQALVSR LRRTLGPAGD IAQVAGGYRL EVDAADVDAL RFERLAAAGR DRLRAGDPNA
AAVVLGEAVA LWGDRPGTEP TVIAAVAPAA ATRLAHASLE AVTDLAAAEL TLGRADAAAA
RLSALLAEHP VHERAAALLM DALTAQGRQA EALALYERVR ETLADVLGTD PGSALRERHV
RLLRAERPAP AADAAQTRPN NLPAPLTSFI GRDDDLARID TLLTTGRLVT VLGPGGAGKT
RLAVEAARRH RHEYRDGAWM IDLASVIEPA KVGAAVLAGI GLRGGAMFEA RVRIEGDELD
VLADQLGGRE SLLLIDNCEH LIDAVAHLVA ALLSRCCGLR VLATSREPLA VDGEALVPLG
PLALPGPDDG VEQARRAASV RLFTERAAAV RPGFEVEETT LPEVLRVVRG LDGLPLALEL
AAARLRTLSL PELADGLSDR FRLLTTGNRT ALPRHRTLRA VIAWSWQLLS EHERTVAERV
SILPGGVTPA SATALCAGTA VPAAEIPELL AGLVDRSLLQ LAPDPGRYRM LETLREYGTD
RLAATGDLAT VRDLAADHFA ELTARYDPQL RGPGQLTAMR VIGAEYDNAL AALRRRCDTG
DASGAVALAL NLTWYWHMFG RHPDAAYWLG EALAVPGGEP TPERDCAQVI SLIDRVGTWP
GRSAEETADD QARIREPADR LLTYPQLPGP YGALTALTLA FLQEEEAAFA IIERLADGDD
VWLSGLARMF RAQFAENAGE LDRTRPDVEA ALACFRQAGD RWGQATVLPM RTQLRQYDDD
LDRALADLRQ ARSLAGEFGS LNLGDEVFMD VRWIDLHLRR GDTDLAIAVI GPARERALRA
NSPGMPALVD AWEAVFRVRL GDLDRARELL DDAERGLRGD TTFPGGHART LVGGVRASLC
LETGDLAGAQ KALEKAYAAA LETRDLPILS LVAVNAAALA EAHGRHQDSA VLLGAASRLR
GAHDHTHRQV RELTRRGRAA LGEEAFAAAY GKGWKLDGKT AVTEVDPARL HRELRTAHPD
PW