Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_4397 |
Symbol | |
ID | 8667691 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 4906604 |
End bp | 4907890 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | putative sigma-70 factor, ECF subfamily |
Protein accession | YP_003340016 |
Protein GI | 271965820 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0260182 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0263489 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGAGC AGCCGACCGA TACCGGTACG GACAGGGCCG TGGAGTCGGT GTTCCGGGAG GAACACGGTC GGCTGCTCGC CTCACTCGTC GGCCGTTTCG GGGACCTCGA CCTGGCGGAG GAGGTCGCCT CCGAGGCGAT CGAGGCCGCG CTGATGCACT GGCCGGTGCA GGGCGTTCCG GCCAAGCCGG GTGCCTGGCT GCTGACGACG GCCCGGCGCA AGGCCGTCGA CCGGCTGCGG CGGGACCAGG CCTACGCCGC CCGGCTCGCC GCCCTGCAGG TGGAGGCGGA CCGGGCCGCC TCCGCCCCGC CCGCGGACGC GGACGCCGAT CTCCCGGACG AGCGGCTGCA GCTGTTCTTC ACCTGCGCCC ACCCGGCCCT GCCGGCCGAG GCTCGCGGGG CGCTGACGCT GCGCTGCCTG GCCGGACTGA CCACACCCGA GGTCGCGCGG GCCTATCTCG TCCCGCCGGC GGCGATGGCC CAGCGGATCG TGCGGGCGAA GAAGAAGATC CGCGAGGCCC GGATCCCCTT CAGGGTGCCG GGCGCCGACG AGTTGCCCGC ACGCCTGCCG GGTGTGCTCC AGGTCCTCTA CTCGATCTTC ACGGAGGGGT ACGCGGCCAG CGCCGGAGCG CAGCTGCAGC GGCTCGACCT CGCCGAGGAG GCCCTTCGGC TGGCACGGAT CCTGCGCCGG TTGCTGCCCG CCGAGCGGGA GGTCGCCGGC CTGCTCGGGC TCATGCTGCT GGTCCACGCG CGGCGCGATG CCCGGACCGG CCCGGACGGC GAGCTCGTGC TGCTGGAGGA CCAGGACCGC GGCCGCTGGG ACCGTACGAT GATCGAGGAG GGCCTCGCCC TGGTGCCCGC CGCGCTGACC GGCGGCCCGC CTGGACCGTA CGGCGTGCAG GCCGCGATCG CCGCCCTGCA CGACGAGGCG GCAGACCTCG CGACCACCGA CTGGCCGCAG ATCGTGGCGC TCTACGGCGT GCTGCTCGCC CTCGCCCCCT CTCCCGTCGT CGCCCTGAAC CGGGCCGCGG CGGTGGCGAT GTGCGACGGC CCGGAGGCCG GCCTGGCGCT GCTCGACAGC CTGGCCGGCG AGGAGAGGCT GCGCGGCCAC CACCCCTACC CGGCGGCCCG GGCGGACCTG CTGCAACGGC TCGGCCGGCT CCCCGAGGCC GCCGCTGCCT ACCGGGAAGC GCTCGCCCTG GCCGGCACCG AACCCGAACG CGCTCACCTG CGACGCAGGC TGGAGGCGGT CGAGCCATCC GGCCCGGACG CCGGGGCCGG CACGTGA
|
Protein sequence | MAEQPTDTGT DRAVESVFRE EHGRLLASLV GRFGDLDLAE EVASEAIEAA LMHWPVQGVP AKPGAWLLTT ARRKAVDRLR RDQAYAARLA ALQVEADRAA SAPPADADAD LPDERLQLFF TCAHPALPAE ARGALTLRCL AGLTTPEVAR AYLVPPAAMA QRIVRAKKKI REARIPFRVP GADELPARLP GVLQVLYSIF TEGYAASAGA QLQRLDLAEE ALRLARILRR LLPAEREVAG LLGLMLLVHA RRDARTGPDG ELVLLEDQDR GRWDRTMIEE GLALVPAALT GGPPGPYGVQ AAIAALHDEA ADLATTDWPQ IVALYGVLLA LAPSPVVALN RAAAVAMCDG PEAGLALLDS LAGEERLRGH HPYPAARADL LQRLGRLPEA AAAYREALAL AGTEPERAHL RRRLEAVEPS GPDAGAGT
|
| |