Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_3842 |
Symbol | |
ID | 8667132 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | - |
Start bp | 4280599 |
End bp | 4281888 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003339504 |
Protein GI | 271965308 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0316121 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.0422385 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCCAC GTTCACCCCG AGGCCGGAAG GCCGGGATCG CCCTGCTGGG CCTGTTGACC GCCGCGTCGG CGGCCGCCTG CGGCGCAGGC GGCTCGGGCG CCACGGAGGC CGCGAAGTCC ACCGGAGGCC TCACGATCAC CATGTGGACG CGTGCCGCGA CCCAGGCGCA GAGCGAGCGG CTGGTCAAGG CCTACAACAG CAGCCACAAG AACCAGGTGA AGCTGACCGT CATCCCGACG GACAACTACC AGCCGCGCAT CGCGGCGGCC GCGGGCGCCA AGCAGCTGCC GGACATCTTC GCCGCCGATG TGATCTTCGT GCCCAACTAC ACCTCGCAGG GCCTGTTCCT GGACATCACC GACCGGATCG GCGCCCTGCC GTACAAGGAC AGCCTCGCGC CCTCGCACAT CAAGCTGGGC ACGCTGGACG GCCGGTCCTA CACGCTCCCG CACACCATCG ACCTGTCGGT CTGGTTCTGG AATAAGGACC TGTACGAGAA GGCCGGGCTG GACCCGGAGA AGGGCCCCGG GACGCTCAAG GAGTTCGCCG AGCAGGCCGA AACGGTCGAC GAGAAGCTCG GCAAGGACGG CAAGGTGCAC GGCACGTTCT TCGGCGGCAA CTGCGGCGGC TGCTACGTGT TCACCTTCTG GCCCTCGGTC TGGGCCGCGA ACGGGCAGGT CATGAACCCC GAGGGCACGG CCTCGCTCAA CGACCGGGCC CCGATGACCG AGGTGTTCGC GATCTACCGC GACCTGTACG CCAAGGGCGT CACCGGCCCC ACCGCCAAGG AGGAGCAGGG CCCGACCTGG ACCGGTTTCT TCCCCAAGGG CGAGATCGGC GTCATGCCGA TGCCCTCGAC CACGCTCGGC TCGATGCCCG AGGACATGAA GATCGGTGTC GCGCCGATCG CCGGGCCCGA CGGCGGCGAG TCGACGTTCG TGGGCGGCGA CTCGGTCGGC ATCTCCTCCA CCACCGAGAA CGCCGACGCG GCCTGGGAGT TCCTGTCCTG GACGGTCTCC GACGAGGCGC AGGTCGAGGT CATGGCCAAG AACAAGGACG TGCCCGCCCG CACCGACCTG TCCGGCAACA AGTACTCCGC CGAGGACCCG CGCGTGGTGA TGATCAACTC GCTGGTGGCC AAGGGGCAGA CGCCGTACGC GCTGCGGTTC GGGCAGACCT TCAACGACCC GCAGGGGCCG TGGCTGCGCC TGGCCCGCGA GGCGGTGTTC GGCGACGCGG GCAAGGTCGC AGGGCTCAAC GCCGACATCA CCAAGTCGCT GCAGCAATGA
|
Protein sequence | MNPRSPRGRK AGIALLGLLT AASAAACGAG GSGATEAAKS TGGLTITMWT RAATQAQSER LVKAYNSSHK NQVKLTVIPT DNYQPRIAAA AGAKQLPDIF AADVIFVPNY TSQGLFLDIT DRIGALPYKD SLAPSHIKLG TLDGRSYTLP HTIDLSVWFW NKDLYEKAGL DPEKGPGTLK EFAEQAETVD EKLGKDGKVH GTFFGGNCGG CYVFTFWPSV WAANGQVMNP EGTASLNDRA PMTEVFAIYR DLYAKGVTGP TAKEEQGPTW TGFFPKGEIG VMPMPSTTLG SMPEDMKIGV APIAGPDGGE STFVGGDSVG ISSTTENADA AWEFLSWTVS DEAQVEVMAK NKDVPARTDL SGNKYSAEDP RVVMINSLVA KGQTPYALRF GQTFNDPQGP WLRLAREAVF GDAGKVAGLN ADITKSLQQ
|
| |