Gene Sros_9294 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_9294 
Symbol 
ID8672642 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp10246809 
End bp10249820 
Gene Length3012 bp 
Protein Length1003 aa 
Translation table11 
GC content75% 
IMG OID 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003344655 
Protein GI271970459 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.705577 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGTTC ACGGCAGCGA AGACGGGTTG CGCTTCGCCG TGCTCGGCCC TGTACAGGCA 
TGGCGTGACG GCACCGAACT CGATCTGGGC ACCCCCCTGC AACGTTCCAT TCTGGCGATG
CTGCTCCTGC GCGAGAGCCG GGCGGTCACC CCGGCCGAGA TGATCGACGC GGTCTGGGGG
GAGGACGCCC CGCCGCGCGC GCTCGGCGCG CTGCGCACCT ACGTCTCCCG GCTGCGGGCC
GTGCTGGAGC CCGGCAGATC CCCTCGCACC CGCCCCGAGC TGCTCATCTC GGTGGGCCGG
GGCTACGCGC TGCGGCTCGG CGGCGTCCTG GACCTGACCC TGTTCGATCG GGGCGTCCAG
GAGGCCGAGG CCGCCCGACG CGCCGGAGAC CGCGCCGGGG CCGCCGAGAG CCTGCGCGCG
AGCCTGGCGC TGTGCACCGG TGAGCCGCTG GCCGGGGCCG TCGGCCCCTA TGCCGAGCAT
CAGCGCGACC GGCTCGTCGA GCGCCGGATG AGCGTGCTGG AGACCCTGAT GGACCTGGAC
CTGGAGCTGG GCCGGCACGC CGACGTCGTC TCCGAGCTGA TCGCGCTGAC CGCCGACCAC
CCGCTCCGCG AGCGGCTCCG CGCCCAGCTG ATGCTGGCCT ACTACCGGTG CGGGCGGCAG
GCGGACGCGC TCGCGACCTT CGCCGACACC CGTAACGCGC TGATCGAGGA GCTCGGCATC
GAGCCGGGAC CCGACCTGGC CGCCCTGCAC CAGCGCATCC TCACCGGCGA TCCGAGCCTC
GCCCCCGCCT CGCCCGCCGC CGCCCCGGTA CGGCTCGCGC CCGCCGACCC CCCGGAGCCC
GCCCGGCAGC CGGAGGCCCC CGAGCCCGGC GCTCCCGAGC TGCCGCGCCC GGCGCAGCTC
CCCGCGGCGG TGAACGACTT CACGGGCCGC CGCCAGATCA TCGCCCGGCT GTGCACCCTG
CTGTCCACCC AGGGCAGCGG CGACGGCGTG CCGGTGGCCG CCATCTCCGG CATCGGGGGC
GTCGGCAAGA CGACCCTGGC GGTGCACGTC GCCCACGCCC TGCACGACCT GTTCCCCGAC
GGCCAGCTCT ACGCCGACCT GCGCGGATAC GGCGAGGAGC CGGTCGCGCC CGAGTCCGTG
CTGGCGGCGT TCCTGCGCGC CCTCGGCCTG CCCGCCGACA TCATCCCGGA CGGCCTGGCC
GAGCGGTCGG CACTGTTCCG CTCGCTCCTG ACCGACCGGC GGATGCTGGT GCTGCTCGAC
AACGCCAGGG ACGCCGCCCA GGTCAGCCAC CTGCTGCCGG GGTCGACCGG CTGTGCGGCG
ATCGTGACCA GCCGCGGCAA GCTCGCCGAC CTGGCCGCGG CCCGGCTGGT CGACCTGGAC
GTCATGGAGC CGGAGGAGGC GCTGACCCTG TTCGGCACGG TCGCCGGCGC CGAACGGGTG
GCCGCGGAGC GTGCCGCCGC CATGGACGTC GTCGCAGCCT GCGGCTTCCT GCCGCTCGCG
GTGCGGATCG TGGCGGCGCG CCTGGCCGCC CGGGCCTCCT GGACGGTCGC CTCGCTGGTG
CCCCGGCTGG CCGACGAGCG CCGCCGCCTC GACGAGATGC GCGTGGGCAA CCTCGCGGTG
GAGGCGACCT TCGCCCTCGG TTACGGACAG CTCAGCCCGG CGCAGGCGCG GGCGTTCCGG
CTGCTCTCCC TGCCGGGCGG CCCGGACATC TCGGCCGGGG CCGCCTCCGC GCTGCTCGCG
CTGAGCCCGA TGGACACCGA GGACATCCTG GAGTCCCTGG TGGACGCCAG CCTGCTGGAG
GCTCCCGCCC CCGGCCGCTA CCGCTTCCAT GACCTGCTCA AGCTCTTCGC CCGCCGTACC
GGCGAACGGG CCGAGGGGGG CGCGGGGCGG GGCGGGGAGG GGGCGCCCGC GCTGCGCCGG
CTGCTCGACT TCTACCTGGC CTCCGCGCGG TCGGCGCACC GGCTCGCCTA CGAGGGCAGC
ACGGTCGCCG ACCAGCTCGC GGCGGCCGGG CCGGGTCACG CGTTCGGCTC CGCCGACGAG
GCCGTGGCCT GGCTGTCGGT CGAGGCGGAG GCGCTGTTCG CGTCGATCGC CCAGGCGAGC
GGCGCCGAGG AGGCCGGGGC CCTGCTGCCC GGCGCCGACC TGCTGCTCGC CATGGAGCCG
CTGCTGGAGT CGGGCAGCCA CGCCCGGGAG TTCGACCAGC GGGCCAAGGA GGTGCTGGCC
GCCGCCCGGC GGCTCGGCGG CACCTCCAGC GAGCTGCGCT GCCGCTACGT GCTGGGCAGG
GTGCTGTTCA ACGCCAACCG GCTGGCCGAG GCGGAGAGCG AGTTCCGGGC GTCGCTGGAC
CTGGCCGCCG GGGGCGACCG GATCGTCACC GGCGAGGCGA TGAACGCGCT GGCCGTCGTG
GCCGGGCGTC AGCGCCGCCA CGTGGAGGCG CTGGCCTGGT TCGACTCGGC GCGCAGGGTG
TTCAGGGAGG TCGGGGCGCG GGGCGGCGAG GCGCTGACGC TCAGCTACTC CGCCCGTGAC
CACCTGTTCC TGGGCCAGCA GGAGGAGGCG ATCGCCGCCG CGGAGCAGGG CCTGGCCGTC
TTCATCGAGC TCGGCTTCAG CGCCGGCACC GCCAGGGCCC GCTACCACCT GGGGATGATC
CTGTCGCGGG TCGGGCGGCT CAACGAGGCG GTCCACCATC ACGCCGAGTG CCTGGCCTTC
TTCCGGGCCA GCAAGCAGCG GGTGTGGGAG CAGCGGGTCT GCTCCCGGCT GGCCGAGACG
TTCATCACCG CCGGCCGGTT CTCCGACGCG ACCCGCCACG CGGAGCAGGC CCTGACGGTC
AGCCGCGAGA TCGGTCATCC GTACGGCGAG GCGCTGTCCC TGTGGGTGCT AGGCAGGGCG
CTCGCCGGGC TCGGCAGCAC GGGCCGCAGC CACGACTGCC TCGAACGCGC CCATGACATC
TTCGTCAGGC TCGGCGCCCC CGAAGCCGCG GACCTGCGCG CCCTGCTCGA TCGTGATCAC
GTCGGCAACT GA
 
Protein sequence
MAVHGSEDGL RFAVLGPVQA WRDGTELDLG TPLQRSILAM LLLRESRAVT PAEMIDAVWG 
EDAPPRALGA LRTYVSRLRA VLEPGRSPRT RPELLISVGR GYALRLGGVL DLTLFDRGVQ
EAEAARRAGD RAGAAESLRA SLALCTGEPL AGAVGPYAEH QRDRLVERRM SVLETLMDLD
LELGRHADVV SELIALTADH PLRERLRAQL MLAYYRCGRQ ADALATFADT RNALIEELGI
EPGPDLAALH QRILTGDPSL APASPAAAPV RLAPADPPEP ARQPEAPEPG APELPRPAQL
PAAVNDFTGR RQIIARLCTL LSTQGSGDGV PVAAISGIGG VGKTTLAVHV AHALHDLFPD
GQLYADLRGY GEEPVAPESV LAAFLRALGL PADIIPDGLA ERSALFRSLL TDRRMLVLLD
NARDAAQVSH LLPGSTGCAA IVTSRGKLAD LAAARLVDLD VMEPEEALTL FGTVAGAERV
AAERAAAMDV VAACGFLPLA VRIVAARLAA RASWTVASLV PRLADERRRL DEMRVGNLAV
EATFALGYGQ LSPAQARAFR LLSLPGGPDI SAGAASALLA LSPMDTEDIL ESLVDASLLE
APAPGRYRFH DLLKLFARRT GERAEGGAGR GGEGAPALRR LLDFYLASAR SAHRLAYEGS
TVADQLAAAG PGHAFGSADE AVAWLSVEAE ALFASIAQAS GAEEAGALLP GADLLLAMEP
LLESGSHARE FDQRAKEVLA AARRLGGTSS ELRCRYVLGR VLFNANRLAE AESEFRASLD
LAAGGDRIVT GEAMNALAVV AGRQRRHVEA LAWFDSARRV FREVGARGGE ALTLSYSARD
HLFLGQQEEA IAAAEQGLAV FIELGFSAGT ARARYHLGMI LSRVGRLNEA VHHHAECLAF
FRASKQRVWE QRVCSRLAET FITAGRFSDA TRHAEQALTV SREIGHPYGE ALSLWVLGRA
LAGLGSTGRS HDCLERAHDI FVRLGAPEAA DLRALLDRDH VGN