Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_8874 |
Symbol | |
ID | 8672212 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | - |
Start bp | 9797333 |
End bp | 9798589 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | protein of unknown function DUF214 |
Protein accession | YP_003344250 |
Protein GI | 271970054 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00933707 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCTGGT TCGAGATCCT CCGATTCGCC CTCCGCGGGC TCGCGGCCAA CAAGCTGCGC AGCTTCCTGA CCACGCTGGG CATCCTCATC GGGGTGGGCG CGGTGATCCT GCTGGTCGCC TTCGGCGAGG GCGCCTCGCA GAGCATCCAG CAGAACATCC AGCGGCTCGG TTCCAACACC CTCACCATCT CCGCCTCGTT CTCCGGAGGC GGCTTCGGCG GTGGAGGCGG GGGCGGCGGC GGAGGTGGCC AGGCGGGCGG GCCGCGCACG CAGGCCAGGC AGCTCACGCT GGAGGACGCC AGGGCGCTGG CCGACCGGGA GCAGGCGCCG TCGGTCAGGA GCGTGTCCCC GGTGGTGACC GCGTCCTCGG CCACCGCCGT GCACGAGGGG GCGAGCCACA CGATCTCCCA GCTCGTCGGC ACCTACCCCA GCTATTTCGA GGCGACGAAC AAGCCCGTCA CGAGCGGCGC CTACTTCGTC AACGACGACG TGCTCGCCGC CCGGAAGGTC ATGGTGATCG GACGGACCGT GGCCGAGGAG CTGTTCGGCA CGGCGGATCC CGTCGGCCGG CAGGTCAGCG TCTCCGGAGT GCCGTTCACG GTGGTCGGGG TGCTCAAGGA GCTGGGCTCC TCGGGGATGA AGGACGCCGA CGACGTCGCG ATCGTGCCGC TGCCCGCCGT ACAGCAGAGC CTGACCGGGT TCGGTGCGCT CGGCTCGATC ATCGTGCAGG CGACCGGCGC CGACACGACG GGGTCGGCCC AGGCCGAGGT GACGGCCGTC CTGAACCAGC GGCACGACAT CACCCCCACG GGCACCGCCG ACTACCGCAT CCTGAACCAG GCCACCCTCC AGGAGACCGT CAGCTCGACC ATCGGCGTCT TCACCGCCCT GCTCGGCGCG GTCGCCGCGA TCTCGCTGCT GGTCGGCGGG ATCGGCATCA CCAACATCAT GCTGGTCACC GTCACCGAAC GGACCAGGGA GATCGGCATC AGGAAGGCCA TCGGGGCGCC CAGGAGCGCC ATCCTCGGCC AGTTCCTGCT GGAGGCGACG GTGCTGAGCC TGGTGGGCGG CCTGTCGGGT GTGGCGATCG CGTTCATCGG CACCCGGTTC ACGATCGCGG GCATCGAGCC CGTGATCGTG CCGTCCTCGA TCGCGCTGGC CCTCGGCGTC TCGGTGGGCA TCGGGCTGTT CTTCGGCAGC TACCCGGCCA ACCGGGCCGC CAAGCTCCGC CCCATCCAGG CCCTGCGCCA CGAGTGA
|
Protein sequence | MSWFEILRFA LRGLAANKLR SFLTTLGILI GVGAVILLVA FGEGASQSIQ QNIQRLGSNT LTISASFSGG GFGGGGGGGG GGGQAGGPRT QARQLTLEDA RALADREQAP SVRSVSPVVT ASSATAVHEG ASHTISQLVG TYPSYFEATN KPVTSGAYFV NDDVLAARKV MVIGRTVAEE LFGTADPVGR QVSVSGVPFT VVGVLKELGS SGMKDADDVA IVPLPAVQQS LTGFGALGSI IVQATGADTT GSAQAEVTAV LNQRHDITPT GTADYRILNQ ATLQETVSST IGVFTALLGA VAAISLLVGG IGITNIMLVT VTERTREIGI RKAIGAPRSA ILGQFLLEAT VLSLVGGLSG VAIAFIGTRF TIAGIEPVIV PSSIALALGV SVGIGLFFGS YPANRAAKLR PIQALRHE
|
| |