Gene Sros_4296 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4296 
Symbol 
ID8667590 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4796436 
End bp4799426 
Gene Length2991 bp 
Protein Length996 aa 
Translation table11 
GC content66% 
IMG OID 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003339928 
Protein GI271965732 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.300157 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGCTCG GAGGCATGCG CCAGCAACGT GTCATGGCCA CGCTACTGCT GGAATCGAAC 
AGGATCATTC CCGTCTCCCG CCTGGTCCAG GCGACCTGGG GGGAGGCGGC CCCCGCTACA
GCGCACCATC AGATCCGGAA AATGGTCGCC GATCTACGCA ATCGCCTGGG CGGTGGCGGG
CAGAGCATCG TGACGGACGG CCCCGGCTAT CGCCTGGTGG TCGAAGAGGA CCAGTTGGAT
CTGAGCCTTT TCGATATCCG TACACGGCGA GCCGGCGAGG CCGAGAAGTC CGGACTACCC
GTCAATGCGA TGACGGAGCT GCAGGAAGCG CTCGACCTGT GGACGGGGCA GGCGCTGAGC
GGAATCGACG GCCCGGTGAT CCGGGCGGCT GCCGAGTTGC TCGACGAACG CCGGGTCGCT
GTGACGGAGC GGCTGATGGA TCTCCGTCTT ACCCATGGTC ACGCGCGTGA TCTGATTTCC
GACCTGCGTG CCCTGGTGTC CGAGCATCCA TTACGGGAAA CGCTGTGGAC CCAGCTGATA
TTGGCCCTGT ACCGGTCGAA CCGTCATGCT GAGGCTCTGC GGACCTACGA AGACGTCCGC
TGCCTCCTGG CCGACGAGCT CGGGGTCGAT CCGGGGATCG AGTTGATGAA GTTGCATGAG
CTGATCCTCA GAAACGATCC GACGCTGGAT TGCCCGTCCC GCCGGACGAA GAGCGGCATC
GGCGTTCCCG TGATGGTCAG ACCTTGCGCC CTGCCGTACG ACGTTCCCGA CTTCACAGGT
AGGGAGGCCG AGCTCCAAAG GCTGACGAGT ATGGTGGCCG AGAACGCCGA TCGGACGGCA
CTGATCGTCA CCATCGACGG GATGGCGGGT ATCGGCAAGA CCGCATTCGC GGTCCACGCC
GCCCACAAGC TCGCTCGGCT CTTCCCCGAC GGCCAGCTCT TCGTGGACTT CAACGGCTTC
ACACCGGGTC GGAAACCCAT CCCGGTGGCC GAGGCGATCG CCACCCTGCT CTCCACCCTG
GGAGTTCCGG ACGGCGAGAT ACCGCATGAC CTGCAAGGCA GGATCGCGAT GTGGCGCATG
AGGACGGCCG GCCGGCGCCT GTTACTCCTG CTCGACAACA CGGCGGACGC GGCTCAGGTG
CTTCCGCTCC TGCCCGGCAT GCCGGGGTGC GTGACGCTGA TCACCAGCCG TGCGCCCCTG
TCCGGAGTGG ATGGCGCGGT TCTCCAGTCT CTGGAGCTTC TCACCCCGGA CGAGAGCCGT
GCGCTCCTGG AGCGCGTAGT CGGGACGATG CGGCTGGCCA CCGAGAGAGA GACCGTAACG
ACGCTGATAG AGATGTGCGG GCGCCTGCCC CTTGCCCTGC GTATCGTCGC CGCCCGGCTC
AACAACCGTC CACAGTGGAG TGTCGCCCAC ATGGTCGACC GCCTGGGGAA CGAACGGCGG
CGGCTGAGCG AGCTGGTTGT GGGGGACCGG AGCGTCCACG CGGCGATCGC GCATTCCTAC
GGTTCGCTGA GGCCGGATCA GCAACGGCTC TTCCGCCTGT TGGGGCTGCA CCCCGGGCAT
GACTACGACG CCTATGCCGC TGCGGCGCTG GCGGGGATAC CTGTCGACGA GGCGGAGTCG
TTACTCGAGG GTCTACTCGA CTCCAGGCTT CTGGTCCAAC GCGAGGTCGG TCGCTACAAC
TACCACAACC TGGTCCTGAG TTATGCCCGC GACGCCGCAT CCTCCCACGA AACCGAGCAC
ACCGGCACGC GTGCGATCCA TCGCCTCCTC GACTACTACC TGCATACGGC GGAGGCGGCC
GCGAACCTCC TGGATCTCGG CCGCCGGCAG GTAGCGCTCT GCCTGAGCTG CCCTCCGTCG
CATGTTCCGC CCTTGGCCGA CGGGAAAGCA TCCCTGCGGT GGTTCGACAC CGAGCGCCAC
AACCTCCGCA TGGCGGTGGA AGAGGCCCAG CGCCAGGGCC TGCATGGTCA CTGCTACCAC
CTGCCTCATA TGATGGCGCA CTACCTCCAG CTCAGGGGGT GTTCCGACGA TCAGCTGACT
CTCCTGAAGA CGGCTTTCGA CGCCGCCGCC CAGCTCAATG ATCACCATGC CCAGGGGCGC
ACTCTGCTCA ACCTCGCCGT GTCGGACTGG TTCTACGGCA GGTTCAGCGA GGCTCTCGAC
AGGGCTGTTC AGGCCCTGGC CGTGGCCGAG AAGCTGGGAG ATGCGGATTT CCAGGGTGCC
TGCCTGAGCC GGATCGGTAT CTTCCACGCG GCCTTGGGAC GCTACGAGGA GGCCCTGCAC
CACTACAAAT GCGCGCTGGA CATCCACGAA GGTCATGGGA ACTGGAGCGA GGTGAGCATG
ACGCTGGTGA GCACGAGCTC GACCAAGGCA ACGCTCGGGC TCTTCGATGA CGCGCATCGT
GACGCCTGTC GCGCGGTCTC CATCGATCGG CGTTCCGGCT ACCGGAACGG AGAAGTCATG
GGGCTGCTCG CCATGTCAAC CGCCCAAGCG GGTATGAACG ATCTCGAAGG AGCGGCCTCG
TCCCTGAACA CCGCGTACGA GCTCGTGCAC ACGGACGAGA TGCCCGGTTA CGAGGCCGCC
GTTCTCGTGC AGCAGGGCCA CCTGCATCGA CGCCTCGGGC AACTGGACGA GGCTACGGCG
GCCGGCCGGC GCGCCATGGA AATCCTCGGC ACGACCCGGC GCTCCGTCAC CGCCATTCGC
GGGCAGAACC TCCTCGGTTC GATCGACTGT GATCGTGGTG ACTACATCCT CGCTTTCGAA
CGGCACAGGT ACGCCCAGCG GCTCGCTCTG CGCTCGGGCC ATCGCCTCGA GACCGCGCGT
GCGCTCGACG GGATCGCTCG GGCCTTGGCC GGGCTGGGCC AGTGTGAGGC GGCGAGGACG
GTGTGGCGAG AAGCCCTCGA TCACTTCGAG GAGATGGGGA CGCACGAGGT GTTCGCGGTG
CGCAGGAGGC TGGCCCGACA GGTGAGCAGC GTACCCGCGC CGCATCCGTA G
 
Protein sequence
MPLGGMRQQR VMATLLLESN RIIPVSRLVQ ATWGEAAPAT AHHQIRKMVA DLRNRLGGGG 
QSIVTDGPGY RLVVEEDQLD LSLFDIRTRR AGEAEKSGLP VNAMTELQEA LDLWTGQALS
GIDGPVIRAA AELLDERRVA VTERLMDLRL THGHARDLIS DLRALVSEHP LRETLWTQLI
LALYRSNRHA EALRTYEDVR CLLADELGVD PGIELMKLHE LILRNDPTLD CPSRRTKSGI
GVPVMVRPCA LPYDVPDFTG REAELQRLTS MVAENADRTA LIVTIDGMAG IGKTAFAVHA
AHKLARLFPD GQLFVDFNGF TPGRKPIPVA EAIATLLSTL GVPDGEIPHD LQGRIAMWRM
RTAGRRLLLL LDNTADAAQV LPLLPGMPGC VTLITSRAPL SGVDGAVLQS LELLTPDESR
ALLERVVGTM RLATERETVT TLIEMCGRLP LALRIVAARL NNRPQWSVAH MVDRLGNERR
RLSELVVGDR SVHAAIAHSY GSLRPDQQRL FRLLGLHPGH DYDAYAAAAL AGIPVDEAES
LLEGLLDSRL LVQREVGRYN YHNLVLSYAR DAASSHETEH TGTRAIHRLL DYYLHTAEAA
ANLLDLGRRQ VALCLSCPPS HVPPLADGKA SLRWFDTERH NLRMAVEEAQ RQGLHGHCYH
LPHMMAHYLQ LRGCSDDQLT LLKTAFDAAA QLNDHHAQGR TLLNLAVSDW FYGRFSEALD
RAVQALAVAE KLGDADFQGA CLSRIGIFHA ALGRYEEALH HYKCALDIHE GHGNWSEVSM
TLVSTSSTKA TLGLFDDAHR DACRAVSIDR RSGYRNGEVM GLLAMSTAQA GMNDLEGAAS
SLNTAYELVH TDEMPGYEAA VLVQQGHLHR RLGQLDEATA AGRRAMEILG TTRRSVTAIR
GQNLLGSIDC DRGDYILAFE RHRYAQRLAL RSGHRLETAR ALDGIARALA GLGQCEAART
VWREALDHFE EMGTHEVFAV RRRLARQVSS VPAPHP