Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_2035 |
Symbol | |
ID | 8665317 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | - |
Start bp | 2183616 |
End bp | 2185595 |
Gene Length | 1980 bp |
Protein Length | 659 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | transposase Tn3 family protein |
Protein accession | YP_003337763 |
Protein GI | 271963567 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.200024 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.850327 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGTCGTG AGTGGGAGCT GGACGAGCTC ATCGAGTGCT GGACGCTGGA TGAGGACGAA CTGGGGCTGC TGAGCAACAA GTCCGGGGCG ACGCGGCTGG GGTTTGCGCT GCTGTTGAAG TTCTTCGAGC TGGAGGGGCG GTTCCCTCGG CGCGAGGATG TGCCCAAGGC CGCGGTGGAG TACATGGCCG GGCAGGTGGG GGTGGAGGCG GCGCTGTTGT TCGCCGACTA CCAGTGGAGC GGCAGGACGA TCGAGTACCA CCGCAGCCAG GTCCGCAAGC ACCATGGGTT CCGGCAGCCG ACAGTGGCCG ATGAGGACAA GCAGATCTTC TGGCTGGCGG GTGAGCTGTG TCCGGAGGAG TTGTCGCGGG ATCGGCTGCG CGCGGCGCTG CTGACGCGGT TCCGGGCGGA GAAGATCGAG CCGCCGACTC CGGTCCAGGT GGACCGGCTG CTGGGCGCGG CCGTGGCGAT GTTCGAGCGG GACTTCACCA CCCGCACCGC TCAGCGGCTG GGATCGGCGG CCGGCGCGCG GCTGGACGAT TTGATCGCGG CTCCCGAGGT CGAGGCCGAC CCTGTCGAGG TGCCGGTGGC CGCGCCGGGA CGCAGCTTCT TGCAGGAGCT GAAGGAGGAT CCGGGGCCGA TCCAGTTGGA CACGCTGCTG GCCGAGATCG TCAAGCTGGA ACGGGTGCGC GCGATCGGCC TGGGCGAGGG CCTGTTCGAG GGGGTGTCGG AGAAGATCGT GGAGTCCTGG CGGGCGCGGG CGATGCGGAT GTACCCCTCC GACTTCGCCG CGGCGGCCGA GCCGGTGCGG TGGACCTTGC TGGCGGCGCT GTGCTGGGTG CGCAAGACCG AGCTGACCGA CGGCCTGGTG GAGCTACTGA TCCAGCTCGT GCACAAGATC AGCGTGCGGG CCGAACGCAA GGTCGAAAGC GAGATCAGCG CCGAGTTCCG GCGGGTGCAC GGCAAGAACG GCATTCTGGT TCGGCTCGCC CAGGCCGCCC TGGACCTGCC CGAGGAGGTC GTGCGCGAGG CGATCTACCC GGTGGTGGGG GCGCGGACGC TGGCAGACAT CGTCGCCGAG GCCAAGGCGA ACGAGAAGGT GTTCAACTCG CGGGTGCGCA CCAAGCTGCG CGGCTCCTAC TCGCGCCACT ACCGGCGCGG GCTACCTAAG CTGCTGCGCG CGCTGGAGTT CGGCTGCTCC AACACCGCCT TCCGGCCGGT GATGGACGCC CTGGCGCTGC TGGACCGGTA CGCCGACTCC GAGGCGGTGC ACTACGACCG CAGCGAGACC GTCCCGCTGG AGCACGTGGT GCCCGACGAT TGGAAGGACG CGGTCGTCGA CCCCGACACC GGCCGCGTGG AGCGCATCCC GTACGAGCTG TGCGTGCTGG TCGCCCTGCA CAAGGCGATC CGCCGCCGGG AGATCTGGAT CACAGGTGCC AAGACGTGGC GCAACCCCGA CGACGACCTG CCCGCCGACT ATGAGAACAA CCGGGACGTG CACTACGAGG CGTTGTCCAA GCCCCGTGAC CCGGCGGCGT TCATCGCCGA CCTGCAACGC CGGCACCTGG CCGCCCTGGA CCGGCTCAAC ACCGCGATGG GCTCCGACAC CACCGGCGGA GTGAAGCTGA CGCGACGCAA AGGCGAGCCG TGGATCTCGG TCCCGCCCCT GAACCGCCGT CCCGAGCCGA CGGGCCTGGT GGCGCTGAAG GAGGAGATCT CGCGGCGCTG GGGCGTGATC GATCTGCTGG ACGTGCTCAA GGACGTCGAC CACGTCACCG GGTTCACCAA GGAGTTCACC TCGGTGGCCT CCCGCACCAT CACCCATCCT GACGTGCTGC AACGCCGGCT GCTGCTGTGC CTGTACGGGC TGGGCACCAA CGTGGGCATC AAGCGGGTGG CCGACGGCGC GGTCGCGGCC GGGCTGGAGG ACAGCGAGGC GGTGCTGCGC CGGATCCGGC GGCTGTTGAT CCGCGGATGA
|
Protein sequence | MRREWELDEL IECWTLDEDE LGLLSNKSGA TRLGFALLLK FFELEGRFPR REDVPKAAVE YMAGQVGVEA ALLFADYQWS GRTIEYHRSQ VRKHHGFRQP TVADEDKQIF WLAGELCPEE LSRDRLRAAL LTRFRAEKIE PPTPVQVDRL LGAAVAMFER DFTTRTAQRL GSAAGARLDD LIAAPEVEAD PVEVPVAAPG RSFLQELKED PGPIQLDTLL AEIVKLERVR AIGLGEGLFE GVSEKIVESW RARAMRMYPS DFAAAAEPVR WTLLAALCWV RKTELTDGLV ELLIQLVHKI SVRAERKVES EISAEFRRVH GKNGILVRLA QAALDLPEEV VREAIYPVVG ARTLADIVAE AKANEKVFNS RVRTKLRGSY SRHYRRGLPK LLRALEFGCS NTAFRPVMDA LALLDRYADS EAVHYDRSET VPLEHVVPDD WKDAVVDPDT GRVERIPYEL CVLVALHKAI RRREIWITGA KTWRNPDDDL PADYENNRDV HYEALSKPRD PAAFIADLQR RHLAALDRLN TAMGSDTTGG VKLTRRKGEP WISVPPLNRR PEPTGLVALK EEISRRWGVI DLLDVLKDVD HVTGFTKEFT SVASRTITHP DVLQRRLLLC LYGLGTNVGI KRVADGAVAA GLEDSEAVLR RIRRLLIRG
|
| |