Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_3951 |
Symbol | |
ID | 8667241 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | - |
Start bp | 4397733 |
End bp | 4398833 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003339604 |
Protein GI | 271965408 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0100165 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0185065 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACCGGC TGGACGAGCT GCGCGGACCG GTACCGTCCC GCGCGCACGA CGCCCGCGCC ATCGCGGCGC TGGCCGCCAA CCCCGGCTGC GCCCGCCGGG CGCTGATGGA CGCCGCTGGG GTGGACAAGG ACCGTACGGC CCGGCACCTC GGCTTCCCGG CCCCCTTCGG CCAGTCGCAG TTCGCGATCA CGCGCGGCAA CGTGTTCGAG GCGCTGGTGA AGGAGAACGG CTGCGCCGAG CTGCTGCGGC TCCTGCGCGA GCTGCTCGGC CTGCCCGTGG CCCAGGTCGG CTACCAGGAC GTGGAGAGCG TCGGCTCCCA CCTGCGCCAC TCCCACACCC GGACGCTGAT CGACCGGGCG GCGCGCGAGA ACGACGACGC GGCGGTCTTC TACGACCACC CGCTGTTCAG CCTGGAGATC GCCGGGCACA CCTCCTACCT GGAGCCCGAC GTGGTGGCCT TCCAGCTCGG GGGGCGCTTC CGCATCGTGG AGATCAAGTC GTTCGGCGTG ATCGACGGCC AGGCCGAGCC CGAGAAGGTC GCCGCCGCGG CCAGGCAGGC CGCAGTCTAC GTCCTGGCGC TGCGCACGCT CCTGGCCGAC CTCGGGCACG ACCCCGAGCG CGTCTCCCAC GACGTGGTGC TCGTCTGCCC GGAGAACTTC GCCAACCGGC CGACCGCGAC GCTGGTGGAC GTGCGCAAGC AGCTCGCCGT GCTCAAGCGG CAGCTCGCCC GGATGACCCG GGTGGACCGC CTGCTCGAAG GGCTCCCCCA GGGGCTCACC TTCGACCTGG CCCCCGACGC GGACGGCGTG CCCACCCGGT CGGCGGAGGA GCTGGCCGGC GCGCTGTGCC AGGTGCCCGC CCGCTACGCG CCCGACTGCC TGTCCACCTG CGACATGTGC ATGTTCTGCC GTGACGAGGC CCGTGGCTGC GGCTCCACCG ACCTGCTGGG CCGCCAGGTC CGCGACCAGC TCGGCGGCGT CTCCCTGATG ACCGAGGCCC TCGGCCTGGC CGAGGGCACC GTCGAACCCG CCGAGGGCCA GGAGGAGGTC GCCCGCCTGC TCCGCCTGGC GGACCGCCTG CGCGAGGAGT GCCTGAATTG A
|
Protein sequence | MNRLDELRGP VPSRAHDARA IAALAANPGC ARRALMDAAG VDKDRTARHL GFPAPFGQSQ FAITRGNVFE ALVKENGCAE LLRLLRELLG LPVAQVGYQD VESVGSHLRH SHTRTLIDRA ARENDDAAVF YDHPLFSLEI AGHTSYLEPD VVAFQLGGRF RIVEIKSFGV IDGQAEPEKV AAAARQAAVY VLALRTLLAD LGHDPERVSH DVVLVCPENF ANRPTATLVD VRKQLAVLKR QLARMTRVDR LLEGLPQGLT FDLAPDADGV PTRSAEELAG ALCQVPARYA PDCLSTCDMC MFCRDEARGC GSTDLLGRQV RDQLGGVSLM TEALGLAEGT VEPAEGQEEV ARLLRLADRL REECLN
|
| |