Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_5351 |
Symbol | |
ID | 8668645 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | - |
Start bp | 5863115 |
End bp | 5865703 |
Gene Length | 2589 bp |
Protein Length | 862 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003340857 |
Protein GI | 271966661 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.466461 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACGG CGATGGGACA CCCTGGGACA GCTGAAACGG CCCTGACCAG TGGCGAGTGC CCTCAGGAAG TTTCCGCCGA GGTGGGACAG GGAGACAGGG GAACGGTGGC TGTGCAGCAG CGGACGGCGA AGGCCGACCC GAAGGACGAG CTGGCCGCCC GCCTGCGCCT GCTCCAGGAG CTGTCGGAGC GCGGAGTCCG GGCCCTCGCA CGGGACGCGG GCCTCAGCTC GTCGTCGCTT TCGCGCTACC TCAGCGGCCA GACGGTGCCG CCCTGGCCCG CGGTGGTCGC GCTCTGCCGT CTCCTCAAGC GCGACCCCCG CCCGCTGCGC CCCATGTGGG AACAGGCCGC GAACCCCCTG CCGGCGCCTC CGAAGACCAG CCGCCAGGTC CAGCCCCCGT CGCCACCCGC AGGGGCTCCG CGCCCGCCCC GCAACGATCT GCCGCGCGAC GTGCCCGACT TCACCGGACG TGAGGCCCAG CTCGCCGCCG TGCTCGCCGC GGTGGACAGC AGCCGGGTGG TCGCCGTCGA CGGCATGGCG GGTGTCGGCA AGACCTGTCT GGCGCTGCAC GCCGCGCACC GCCTCGCCGC CGACTATCCC GACGCCCAGC TCTATGTGGA CCTGCACGGG TTCACCGACG GGCGGGAACC GCTCGGTCCC GAACCCGCGC TGCGGGCCTT GCTCGCCGCC CTCGACGTGC CATCGGAAAA GATCCCGCAG GAGGGTGGCA TCGAGCCGCT GGCGGCCTGC TGGCGGTCGG AACTTGCCGG CCGGCGGGCC GTCGTGGTCC TCGACAACGC CGCCGGCGCC GACCAGGTCC GCCCGCTGCT GCCCGGCGCC GGCCACTCCG TCGCCCTGAT CACCAGCCGC AACCGGTTGC TGGGCCTGGA CGAGGTGCCC CCGGTGTCGC TGGACGTGCT GACCCCGGAG GAGAGTGCGG AGCTGCTGGC CCGCGCCAGC GGCGATCCCG GCGGCTCCGA CGGCCGGCTG GCCCGCGACC CGGAGTCCGC GGCCGAGGTG CTGCGGTTGT GCGGCCACCT GCCGCTGGCC CTGCGGCTGG CCGGGGCCCG GTTGCGCCAC CGGCCGGGCT GGACTGTAGG CATCCTCGTC GAGCGTATGG CGGAGGGCGC GGGCGAGTTC GACACCGCGC TCGCGATGTC CGTACGGCAG CTGGACCGGG CCGAGCGCCG GTTGTTCCGG TTGCTGGGCC TGCTTCCCGG CTCGACCTTC GACGAGTACG TGGCGGCGGC ACTGGCGGAC ATACCTCTGC GCAGCGCCCG GGCGATGCTG GAGGACCTGC TCGACGCGCA CCTCGTCCAG CAGGCGGCGG CCGGCCGCTA CCGGTTGCAC GATCTGGTGC ACCAGCACGC GCGCCGGTCC GCTGCCGAGC AGGACTCACC GGCGGAGCTG GAGCAGGCGC TGGGCCGGGT GCTCGACTAC TACGTGCACG CGGCGGCCAC GGCCGACGCC GCGATGTCGT ACTTGTCTCC CAGCCGGGCG GTCTCCGCGG GCCGCCCCCC GGCCGAGCTG CCGCGGTTCG CGGGCAAGCA CGCGGCCCTC TACTGGTTCG TCACTGAGTA CACCAACCTG ATGGCCGTCT TCGAGGCGGC GGTCGCGGCC GGAGCCGACG TGCACGTGTG CGAGCTGCCG CGCTTCATGC GGGCGTTCTT CGCGCGGCGC TGTGGCACGA CGCATCTCAA CGTCCTGTTC GAGCGGTCGC TGGTCGCCGC GCAGCGCCTG GACGATCCGC TGCAGCTGGC CGAGGCGCAC AGCGACCTGG GCTTCGCCCG GTACAACGCG GGCCGGATGG CGGAGGCGAG TGCCGCGTAC GAGGCGGCGG CACCGCTGCT GTCCCAGGCC GTGGACACCA GGTCCGAGGC CGAACTGACC ATGCGCCGCG GCTACCTGAG GTGGGACGAG GGCCATGTCG AGGAGCCACT CGGCCTTTTC CGGCTGGCGG GCAAACTGTA CGCGGACGCC GGCTGCCCGA TGGGCACGGC CCATGCGATC GCGTACGAGG CCTGGGCGAT GCTGCAGCTG GGACACCGTG AGGAGGCGGC GCGGCTGGCC CGCGAGGCGC TGGACATCCC GCACACCGAC CCCGCATGGC CCCCCACGCT GACGGCCCGG ATCACCCTCG GCGTGGCCAT CGCCCCCGAG GAACCCGACG AGGCGATGGA GCACCTGGAG CAGGCGCTGG CGCTGGCCCG CGAGGACGGG CACAAGCACA ACGAGGCGTG GTGCCTCAAC TGCCTGGGCG TCGCGCTGCG GCAGACAGGC CGGTACGAAG AGGCGCTCGC CAGCCACCGC CAGGCGTTCG CCCTGCTGGA CGAGCTGTTC GAGGAACATT GGAAGATCCA TTTCCTCAAC GGCTACGCCG AGACCTGCCG GCTGGCGGGC CTGCCCGAGG AGGCCCTGCG GCTGCACCGG CAAGCGCTGG AGCTGGCCCC GAGACTCGGG TACCGGTACG CGGAGGCGCT GGCCCACGAG GGAATCGCCG GCGTCCTCGA CGAGACGGAC CCGCCCGCAG CCGCCGAACA CCGCGCGGCC GGGCAGGCCA TCGTGCAGGA GCTCAAACCA GGCGCCTGA
|
Protein sequence | MTTAMGHPGT AETALTSGEC PQEVSAEVGQ GDRGTVAVQQ RTAKADPKDE LAARLRLLQE LSERGVRALA RDAGLSSSSL SRYLSGQTVP PWPAVVALCR LLKRDPRPLR PMWEQAANPL PAPPKTSRQV QPPSPPAGAP RPPRNDLPRD VPDFTGREAQ LAAVLAAVDS SRVVAVDGMA GVGKTCLALH AAHRLAADYP DAQLYVDLHG FTDGREPLGP EPALRALLAA LDVPSEKIPQ EGGIEPLAAC WRSELAGRRA VVVLDNAAGA DQVRPLLPGA GHSVALITSR NRLLGLDEVP PVSLDVLTPE ESAELLARAS GDPGGSDGRL ARDPESAAEV LRLCGHLPLA LRLAGARLRH RPGWTVGILV ERMAEGAGEF DTALAMSVRQ LDRAERRLFR LLGLLPGSTF DEYVAAALAD IPLRSARAML EDLLDAHLVQ QAAAGRYRLH DLVHQHARRS AAEQDSPAEL EQALGRVLDY YVHAAATADA AMSYLSPSRA VSAGRPPAEL PRFAGKHAAL YWFVTEYTNL MAVFEAAVAA GADVHVCELP RFMRAFFARR CGTTHLNVLF ERSLVAAQRL DDPLQLAEAH SDLGFARYNA GRMAEASAAY EAAAPLLSQA VDTRSEAELT MRRGYLRWDE GHVEEPLGLF RLAGKLYADA GCPMGTAHAI AYEAWAMLQL GHREEAARLA REALDIPHTD PAWPPTLTAR ITLGVAIAPE EPDEAMEHLE QALALAREDG HKHNEAWCLN CLGVALRQTG RYEEALASHR QAFALLDELF EEHWKIHFLN GYAETCRLAG LPEEALRLHR QALELAPRLG YRYAEALAHE GIAGVLDETD PPAAAEHRAA GQAIVQELKP GA
|
| |