Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_4762 |
Symbol | |
ID | 8668056 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | - |
Start bp | 5286943 |
End bp | 5288004 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | |
Product | AraC-type DNA-binding domain-containing protein- like protein |
Protein accession | YP_003340338 |
Protein GI | 271966142 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.505446 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTGTCCG AGACGGTGTT TCGGAGCGAT GACGTGCCTC CGGCGGATCG ATTCGAGCGC TGGCGTGAGC TTGTCAACCA GGCGCACGCG CCCATGGACA TGACCAGCGA CCACCGGGAG GATTTCCGGG CCTCTCAGCG TCTGCTGGAT CTCGGCTCCG TTTCCGTATG GCGGACGGCC TTCCAGTCCG TATGTTGTCG CCGGACCCCG AAGCTGATCC GGCAGTCCGA CCCCGAAGGG GTGCACCTGT CGCTGCCCAC GAACGGCCCT CTCGTAACCG TTCGCGGCGA TCACGAGATC GTATACGACC CATATAGCCT ATGTGTTTAC GACACCTCGC GGCCCACCGA GTTACACGCG GGCGACTCAT CAAACCTGCA TGCGGGAGTG GCGCTTGAGA TCCCCAAGGC GCTACTGCAC CTGCCCGGGA ACATGCTTGA AAAGCTGACT ACGCGCCGGC TGTCGGTGCG GGAGGGCTTC GGCGCCCTAC TGGCCCATTT CCTTACACAT TTGATGAAAG GAACCGGCTC ATACCAGCCG TCCGACGGGT TTCGGCTGGG GACGGTCGCG GTCGACCTCG TGTCCGCGCT GTTCGCCCAC ACCCTGGACG CCGACGACAT CCTCCCTCCG GAAACCCGCA GACAGACCCT GATCCTGCGC ATTCACGCCT TCATCGAGCT CAACCTCGTC GACCGCGCTC TGACTCCGGC CGATATCGCG GCCCATCACC ACATATCCGT CCGCTACCTG CAGCTTCTCT TCCAGCAGCA GGGCAAGACC GTGACGGGCT GGATCCGCCG GCGACGCCTC GAACGGTGCC GCGAGGACCT CGCCGACCCC TCCCAGCTCG CTCGCTCGAT CCGTGCCGTC GCCCTGCGGA GGGGCTTCGC CACCTCCGCC GACTTCAGCC GCGCGTTCCG TGCCGCGTAC GGCATGTCCC CTTCGGAATA CCGTCATGCA GCACGCGCCG GTGATCTAGA GGCGATTCTG CCGGCAAACG CCTTGGGCAT CGCGCCGCCG GCGGTCCAGA AAAATGGGGG AGACATCTCT TCGGGTGCGT GA
|
Protein sequence | MLSETVFRSD DVPPADRFER WRELVNQAHA PMDMTSDHRE DFRASQRLLD LGSVSVWRTA FQSVCCRRTP KLIRQSDPEG VHLSLPTNGP LVTVRGDHEI VYDPYSLCVY DTSRPTELHA GDSSNLHAGV ALEIPKALLH LPGNMLEKLT TRRLSVREGF GALLAHFLTH LMKGTGSYQP SDGFRLGTVA VDLVSALFAH TLDADDILPP ETRRQTLILR IHAFIELNLV DRALTPADIA AHHHISVRYL QLLFQQQGKT VTGWIRRRRL ERCREDLADP SQLARSIRAV ALRRGFATSA DFSRAFRAAY GMSPSEYRHA ARAGDLEAIL PANALGIAPP AVQKNGGDIS SGA
|
| |