Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_5233 |
Symbol | |
ID | 8668527 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 5749155 |
End bp | 5750204 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | AraC family transcriptional regulator |
Protein accession | YP_003340745 |
Protein GI | 271966549 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.896357 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.182047 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCAAG ATTCCTCGCA CGACGGTCGC GCGGCGGACC TGCACCGGGT CGTCGTGATC GTGGACGAGA ACTCCAACCC CTTCGAGCTC GGCTGTATGA CCGAGGTCTT CGGCCTGCGC AGGCCCGAGC TTGGCCGCGA TCTCTACGAC TTCCGCCTCT GCTCGCCCGA ACCCCGCACC CTCATGCGAG ACGCCTTCTT CACCCTGACC GGAGTCGCCG GACTGGAGGC GGCCGAGTCG GCGGACACGT TGATCGTCCC CAACCGCCCC GACGTCGAGG TGCCGCGCCG CCCGATCGTG CTGGACGCCG TACGGCGGGC GCACAGGCGC GGTACGCGTC TTGTCGGCTT GTGCAGCGGA GCTTTCACCC TCGCCGAGGC CGGGGTCCTC GACGGGCGCC GGGCCACCGC CCACTGGCAG TGGGTGGACG CCTTTCGTGC CCGCTTCCCC TCCGTGCGGC TCGAGGAAGA CGTGCTGTTC GTCGACGACG GCGACATCCT CACCGCCGCC GGTAGCGCGG CCGCTCTCGA CCTCGGGCTG CATATCGTCC GCCGCGACCA CGGCGCCGAG GTCGCCAATT CCGTCAGCCG GCGGCTGGTC TTCACCGCGC ACCGGGACGG CGGGCAGCGG CAGTTCATCG AGCGCCCGGT GCCCGACCTG CCGGACGAGT CCCTGGCGCC GATCCTGGCA TGGGCGCAGG AGCGGCTGGA CTCACCGCTC ACGGTCTCCG ACCTCGCGGC GCGCGCCTCG GTCAGCCACG CGACACTGCA CCGGCGCTTC CGGGCACAGC TGAGCACGAC GCCGCTGGCA TGGCTCACGG GGGAACGGGT CGCCCTGGCC TGCCGGCTGA TCGAGCGGGG TGAGACACAC CTGGAGGTGG TGGCACGGCA CAGCGGGCTG GGCACCGCCG CCAACCTGCG CGCGCTGATG CGCCGCCAGA CAGGGCTCAC CCCGTCGGCG TACCGGCAGC GGTTCGGAGC TGGAGCGGCC CGTCAGCGAC CGCTTCCGCC CCGCCCGGCT CGCCCTGCCG CACCACTTCC GGCAAGCTAG
|
Protein sequence | MSQDSSHDGR AADLHRVVVI VDENSNPFEL GCMTEVFGLR RPELGRDLYD FRLCSPEPRT LMRDAFFTLT GVAGLEAAES ADTLIVPNRP DVEVPRRPIV LDAVRRAHRR GTRLVGLCSG AFTLAEAGVL DGRRATAHWQ WVDAFRARFP SVRLEEDVLF VDDGDILTAA GSAAALDLGL HIVRRDHGAE VANSVSRRLV FTAHRDGGQR QFIERPVPDL PDESLAPILA WAQERLDSPL TVSDLAARAS VSHATLHRRF RAQLSTTPLA WLTGERVALA CRLIERGETH LEVVARHSGL GTAANLRALM RRQTGLTPSA YRQRFGAGAA RQRPLPPRPA RPAAPLPAS
|
| |