Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_1543 |
Symbol | |
ID | 8664819 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | - |
Start bp | 1633720 |
End bp | 1635438 |
Gene Length | 1719 bp |
Protein Length | 572 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003337279 |
Protein GI | 271963083 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.100753 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.657816 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCACC CAGACGTCCC TCGATCGCAG GGCAGGCACG CCGGCGGGCC CGCCTCCACC GGGGCACCGA GCCCGGCGGT GCTGGAACAG GCCTCGCTGG AGCAGGTGAT CGGGCGGCTC GGGCCCATGG CGCCGCAGCA GGCCGCCACC GTCGGCCTCG CCGTGCTGGA CCAGCTCGTG GTCGTGCACG GCCAGGGGAT GCTCCACGGT GACGTACGGC CCGGCTCGGT GCTGCTCGGT CCCTACGACC AGATCATCCT CAGCGCGCCG ACCTTCCGGT CCCCGACCTT CACCGCCCCC GAGGGCGTGA CGGGCCCGGC GGCCGACCTG TGGTCGCTCG GTGCCACCCT CTACACCGCC GTCGAGGGGC GGGCGCCGTC ACCCGGGGGA TCCCTCGAGA ACGCGGGCCC GATCGCGCCG GTCCTGTTCC AGCTGCTCTC CGGCGACCCC GCCCGGCGGC CCGACCCCGG CACCCTGCGC AACATCCTGC TCGGCATCTC CCAGAGCCGC GGCGAGGCCC CGGCCCCGCT GCCCCCCGCC CCCGCGGACC TGCTCTCTCC TCCGGATCCG CTGCCCTCCG CGGACCCGCT GTCGGGGCCG CCTCCCGCGG ACGCGCTGTC GGGTCCCCCG GACGCGAGAT CCGCCGCCCC CTCCTCCCAC GCGCCGTCCC CCTCGGACGC GCTGTCCGGA GTCCCGCTCC CCTCGGCGCC CTTCGAGACG CAGTCCACCG TCCCGGTGCT GACCGCGACG GCGCAGCCCT CCACCCCGGA GGCGGCCTCA CCGCCCCAGC CCGTCTTCGA CTCCGCGGAC ACCATGCCGC CCCGGGCCGC CTCCGATCCG GCGGCCATCG CGCCGATCCC CGCGGATCCG CGCGGGCCGG GGGTTCCCCC GCAGGCCCCG CCCCCTTCCG GCGCCCCCGC CTCGCAGGAA CTCGTCCCCG CCACCGGCGG GCCACGCGAG GTTCTCCCCG CCTCTCCGGC GCAGGGCGAA TCGACGGGCC CGACCAGTCC CGCCGGTCCG GCCGGACCGG CCGGACCGGC CGGACGCTCC GACCGGCGGG CCGGGGTGCT GGTGCCCCGC CCGGTCGTGG CGCTGACCGG TGTCCTGGTC CTCGGCATGG CGGTCGCCAT CGGCGTCCTG CTCGCCTCGC CGGGCGACGG CTCCGGCGAG GGCGACGCCA CCGCCGCACC CGCCGCCGGC GCCAAGGGCC TGTTCGCCAC CGCGCCTCGC GCCTGCAGCC TGCTCGACGA CAAGCAGGTG AACGAGCTCG TGCCGGGCTT CAGGAGCTCG GAGGTCGAGC CCGCCGCGTG CGACTGGCTC AATCAGCATG ACTGGCGCAA GCCCAGCCCG GAGAAGTTCG ACCTCCGCGT ACGGCTGGTC GCCCAGAAGC CGGACGCCTC CGGGGTCGAG CGGGCGAAGG AGTATCTGTC CGGCAAGAGG ACGGACCTCG TGGCGAGCGG CAAGTTCGCG ACCCCGAAGC CCGCGCCGCC CCAGAGCCTG AAGGGGATAG GCGAGGAGGC CTTCACCACG GGCGGCTACA ACTCGATCAA CCTCTACGGC GGCTCCTACA AGGCGACCGT GCTCTTCCGG GTCGGCAACC TGATCGCCCA GGTCGAGTAC GAACGGGGCG GCGTCAAGGA GGACCGCGAC GGCGAGATCG CGGCGGGCGC CCAGAAGGCC GCCCGCTGGC TCACCCAGTC GTTGAAGACC GATGGCTGA
|
Protein sequence | MNHPDVPRSQ GRHAGGPAST GAPSPAVLEQ ASLEQVIGRL GPMAPQQAAT VGLAVLDQLV VVHGQGMLHG DVRPGSVLLG PYDQIILSAP TFRSPTFTAP EGVTGPAADL WSLGATLYTA VEGRAPSPGG SLENAGPIAP VLFQLLSGDP ARRPDPGTLR NILLGISQSR GEAPAPLPPA PADLLSPPDP LPSADPLSGP PPADALSGPP DARSAAPSSH APSPSDALSG VPLPSAPFET QSTVPVLTAT AQPSTPEAAS PPQPVFDSAD TMPPRAASDP AAIAPIPADP RGPGVPPQAP PPSGAPASQE LVPATGGPRE VLPASPAQGE STGPTSPAGP AGPAGPAGRS DRRAGVLVPR PVVALTGVLV LGMAVAIGVL LASPGDGSGE GDATAAPAAG AKGLFATAPR ACSLLDDKQV NELVPGFRSS EVEPAACDWL NQHDWRKPSP EKFDLRVRLV AQKPDASGVE RAKEYLSGKR TDLVASGKFA TPKPAPPQSL KGIGEEAFTT GGYNSINLYG GSYKATVLFR VGNLIAQVEY ERGGVKEDRD GEIAAGAQKA ARWLTQSLKT DG
|
| |