Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_5737 |
Symbol | |
ID | 8669031 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 6280346 |
End bp | 6281476 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | putative sorbitol dehydrogenase |
Protein accession | YP_003341228 |
Protein GI | 271967032 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0763433 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.0753834 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACAG CGGCGCGAGC AGCGATCGTC GTGGACGCCG CGGGCGCGGT GGAGATCCGG TCCTACGCCG TGCCGGAGCC CGGTCCGGGC GAGTTCGTTC TCAAGCTGGA GCTGTGCGGC GTCTGCGGCA CCGACGCGCA CATCTCCCGG GGGCGCCTGG CCAGCGTCAC CTTCCCCGCC CTGCTGGGAC ACGAGATCGT CGGCACCCTG GCCGCCCTCG GCGAGGGGGT GACCGCCGAC CACGGCGGCC GCCCCGTGGC CGTGGGCGAC CGCGTCGGCG TCTTCCCCGC GCTGAGCTGT GGTCGCTGCT ACCAGTGCCA GGTACGGCGC CGCCCCGCCA ACTGCCCCGA CCGGCGGCCC TCCTACGGAT TCAAGTCTCC GGTGACGACC CCGCCGCACC TCACCGGCGG GTTCGCCGAA TACCTCCACG CCGCCAACGC CGGCACGGTG TTCTACCGGA CGGACCTGCC CCCCGAGGTC GCGGTCCTGC AGGAGCCGAT GTCGGTGGCG CTGCACGGGA TCGAGCGGGG CTCGGTCGGC GTCGGCTCCA CCGTGGTCGT CCAGGGCGTG GGCGCGATCG GGCTGATGGC CGTGGTCGCG GCCCGCGCGG CGGGGGCGCA CCGGGTCGTC GCGGTGGGGG CCCCCGCCGC CCGGCTGGAG CTGGCGGCCG CGCTCGGGGC GGACGCGACG GTGAGCATCC ACGACCTGAC CGGCGTGGCC GAGCGCCGCA GCGCGGTGTT CGACGCCCTC GGCTCGCCCG GGGCGGACTG CGTGATAGGC GCCAGCGGCT CGCCGGAGGC CTTCATGGAG GCCATCGGCC TGGTGGCCGA CGGCGGCGTG CTCGCCGAAC TGGGCAACTT CACCGACCGG GGGACGGTGC CCTTCAACCC CTTCAGCGAC CTGCTGAAGC GTGACATCAC CATCGCCGGC GTCTACGGCG CCGGCCCCGA CATGCTGCGC CGCTACCACC AGGCGCTGCT CATCCTGGAA CGCGGCGGCT GGCCGTACGA CCGTGTGGTC AGCCACCGGG TGCCGCTGGA GCGGGTGGGC GAGGCCCTCG CCGCCCTCGG TGGCGGCTCG CCCCTCGACG GCCGGGAGAT CGTCAAACTC GCGATCGACC CCACCGCCTG A
|
Protein sequence | MSTAARAAIV VDAAGAVEIR SYAVPEPGPG EFVLKLELCG VCGTDAHISR GRLASVTFPA LLGHEIVGTL AALGEGVTAD HGGRPVAVGD RVGVFPALSC GRCYQCQVRR RPANCPDRRP SYGFKSPVTT PPHLTGGFAE YLHAANAGTV FYRTDLPPEV AVLQEPMSVA LHGIERGSVG VGSTVVVQGV GAIGLMAVVA ARAAGAHRVV AVGAPAARLE LAAALGADAT VSIHDLTGVA ERRSAVFDAL GSPGADCVIG ASGSPEAFME AIGLVADGGV LAELGNFTDR GTVPFNPFSD LLKRDITIAG VYGAGPDMLR RYHQALLILE RGGWPYDRVV SHRVPLERVG EALAALGGGS PLDGREIVKL AIDPTA
|
| |