Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_3614 |
Symbol | |
ID | 8666902 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 4008701 |
End bp | 4011730 |
Gene Length | 3030 bp |
Protein Length | 1009 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | transposase Tn3 family protein |
Protein accession | YP_003339289 |
Protein GI | 271965093 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCGTG TGCTGGACGA GGACGAGCTC GTTGGGAACT GGACGCTGGT CGGGGATGAG CTGGATCAGC TGTCAGGCCG GCGGGGCGTG ACCAAGCTCG GGTTCGCGCT CCTGCTGCGG TTCTATGCGC TGAACGGCCG GTTCCCGACG GGCCGCGCCG AGCTTCCCGA TCAGGCCGTC GCCTACGTCG CCCGGCTGGT CGACGTACCC GCCTCAGAGC TGGGGCTGTA TGAGTGGGAC GGCCGGACGA TCAAGGATCA CCGCAAGGAC ATCCGGAAGT ACTTCGGGTT CCGCGAGTGC TCTCTCGCTG ATTCCGACAA GGCCGCGGAC TGGCTGGCGG CCCAGGTCTG CCTGAAGGAG CGGCAGGTCG ACCGCGTCCG CGCCGAGCTC CTGGCGCATC TGCGCCAGGA GCGGATCGAG CCGCCGGCGC GGGACCGGAT CCGCCGGATC ATCGGGACAG CGCTGCGGCA GGCCGAGCAG ACGCAGACCT CCCGGATCTC CAGCCGGATC CCTGCAGAGG CCGTCCCGCA GCTGCTCGCC CTGATCGCCA AGAGCACCGA TCCCGGTGAC GGACCCGAAG ACCAGGACGA GGACGACGGC GCGTTGTTCG GCGCCGCCGA GGCCGCCGCG GTGGACGTGT TCGCGGTGAT CCGCGAGGAG CCGGGCAACG TGAGCGTGAA GACGATCGAG CGGGAGGTGT TCAAGCTGAC CGCGATCGGC AAGGTCGGCC TGCCGGACAA CCTGTTCGCC GACGTCGCTC CGAAGGTGCT GGCAGCCTGG CGGGCTCGGG TCGCGGCCGA GGCGCCGTCG CACCTGCGCT CGCACCCGCA CGACGTCAAG GTGACGTTGC TGGCCGCCTA CTTGTACTGC CGGTCCCGTG AGATCACCGA CACTCTGGTC GACCTGCTGA TCGCGACGGT GCATCGGATC AACGCCCGCG CCGACACGAA GGTCATCAGT GACTTCGTGG CGGAACTGAA GCGGGTGTCG GGCAAGGAGA ACATCCTGTT CAAGATGGCC GAGGCCGCGC TGGAGTCGCC GCAGGCGCGG GTGGAGGACG TCATCTACCC GGCGGTGCCG GGCGGCTACA AGACGCTGGT CCAGCTGCTG CACGAGTACA GGGCGAAGGG CACCTCCTAC CGGCAGCACA AGCAGCGGGT GTTCAAGGCC TCCTACACCA ACCACTACCG GATCGGCCTG ATCCAGATCC TGGAGGTGCT GGAGTTCGGG TCGACGAACA CCGTGCACAC GCCCATGGTG CAGGCCCTCA CCTTGATCAA GCGGTACAAG GCCGAACAGT CCAACCGGAT CAAGTGCTAC GCCCTCGGCG AGCACGTCCC CGTCGACGGG ATCGTCCCGG CCGAGCTGGT GGAGCTGATG TACCGGGCCG ACAGCACCAA GCGGCAGCGG ATCCTGCGCA CCGTGTACGA GTGCGGCGTC TTCCAGTCCC TGCGCGAGAA GCTGCGCTGC AAGGAGATCT GGGTCCACGG GGCGGACCGG TGGCGCAACC CCGACGACGA CTTGCCCAAG GACTTCGAGG CCAATCGCAC CGAGAACTAT GCCAAGCTCC GCAAGCCGCT GGACCCGCAG GTGTTCGTCG ATCAGCTGCG CGAGGAGATG GACACCGAGC TGTCGGCGCT GAACGACGCC CTAGGTGGCA AGGGCCTCAC GTGGTTGAAG ATCGCCGAAC GGCGCAACGC CGGGGCGATC CACCTCACCC CGCTGGACGC TGCCCCAGAG CCGCGCAACC TGCGCCGACT GAAGGCCGCG ATCCGGGACC GGTGGGGCGT CGTCCCGCTG ATGGACATGC TCACCGAGAC CGCCCTGCGG ACCGGCTGCT TGAACGTCTT CACCCCGGCC GGAACGCAGA ACCACCTGGA CCCGGCCGTG TTGTTCGAGC GGCTCCTGCT GCTGATCTAC GCCTACGGGA CGGGGACCGG GATCCGCGCG GTCGCCGCCG GCGACCACCC GCACACCGAG GACGACCTGC GGTACGCGCG CCGCCGCTAC CTGACCGTCG AGGCCTGCCG CGAGGTCGCC CGGGTCATCG CCAACGCGAC CTTCGCCGTC CGGCAGGCTG CCCTGTGGGG CGTGGGCACG ACCGCGGTCG CCTCGGACTC CACCCACTTC TCAGCTTTCG ATCAGAACAT CTTCACCGAG TGGCACTCCC GCTACCGGCG GGCCAAGCGC GGGGTGCTGA TCTACTGGAC GGTCGAGGTA GGTGGGTCGA TGGCCGTGCA CAGCCAGCTC ATCAGCTGTT CAGCCTCCGA GGTCCACGCG ATGGTCGAGG GCGCGATGCG GCACGGCACC GACATGGATA TCGAGCAGAA CTTCGTCGAC TCCCACGGCG CCAGCTTCGT TGGGTTCGGG ATCACCAGGC TGCTGGACTT CGACCTGGTC GCCCGGTTCA AGCAGATCAA CAAGATGAAG CTGTACGTGC CAGGCCGCGG CGAGGACTTC TCCTACCCGC TCCTCAGCCC CGCGCTGACC AGGCCAATCC GGTGGGACAT CATCACCCAG AACTACGACA TCATGATGAA GTACGCCACC GCGATCCGGC TGCGGACCGC GTCCACCGAG GCCCTGTTGC GCCGATTCAC CAGCGAGACC ACCCACCCGG CCTACGCCGC GATGCTAGAG GTCGGCCGCG CCCAGCGCAC GATCTTCCTC ACCCGCTGGC TCCGCGACCG CGATCTCCAG CGGGAGACCG AATCCGGATT GAACGTGGTG GAGAATTACA ACGGCGTCAA CGACTACATC AAGTTCGGCA AGCGCGGCGA ACTCGCCTCC AACCGGCGCG AAGAGCAGGA GCTGGGGATG CTGTGCCTGC ACATCCTCCA GTCGGCCCTC GGCCTGATCA ACACCCTGAT GATCCAGGAC ACCCTCGCGC TACCCGAGTG GGAGAACGTC CTGACCGACG CCGACCGGCG CGGCCTCACC CCGCTGTTCC ACACCAACAT GACGCCCTAC GGCGAGATCC AGCTGCGCAC GGACCGGCGC CTCGACCTCA CCGACCTGCC GACCGCGTAG
|
Protein sequence | MARVLDEDEL VGNWTLVGDE LDQLSGRRGV TKLGFALLLR FYALNGRFPT GRAELPDQAV AYVARLVDVP ASELGLYEWD GRTIKDHRKD IRKYFGFREC SLADSDKAAD WLAAQVCLKE RQVDRVRAEL LAHLRQERIE PPARDRIRRI IGTALRQAEQ TQTSRISSRI PAEAVPQLLA LIAKSTDPGD GPEDQDEDDG ALFGAAEAAA VDVFAVIREE PGNVSVKTIE REVFKLTAIG KVGLPDNLFA DVAPKVLAAW RARVAAEAPS HLRSHPHDVK VTLLAAYLYC RSREITDTLV DLLIATVHRI NARADTKVIS DFVAELKRVS GKENILFKMA EAALESPQAR VEDVIYPAVP GGYKTLVQLL HEYRAKGTSY RQHKQRVFKA SYTNHYRIGL IQILEVLEFG STNTVHTPMV QALTLIKRYK AEQSNRIKCY ALGEHVPVDG IVPAELVELM YRADSTKRQR ILRTVYECGV FQSLREKLRC KEIWVHGADR WRNPDDDLPK DFEANRTENY AKLRKPLDPQ VFVDQLREEM DTELSALNDA LGGKGLTWLK IAERRNAGAI HLTPLDAAPE PRNLRRLKAA IRDRWGVVPL MDMLTETALR TGCLNVFTPA GTQNHLDPAV LFERLLLLIY AYGTGTGIRA VAAGDHPHTE DDLRYARRRY LTVEACREVA RVIANATFAV RQAALWGVGT TAVASDSTHF SAFDQNIFTE WHSRYRRAKR GVLIYWTVEV GGSMAVHSQL ISCSASEVHA MVEGAMRHGT DMDIEQNFVD SHGASFVGFG ITRLLDFDLV ARFKQINKMK LYVPGRGEDF SYPLLSPALT RPIRWDIITQ NYDIMMKYAT AIRLRTASTE ALLRRFTSET THPAYAAMLE VGRAQRTIFL TRWLRDRDLQ RETESGLNVV ENYNGVNDYI KFGKRGELAS NRREEQELGM LCLHILQSAL GLINTLMIQD TLALPEWENV LTDADRRGLT PLFHTNMTPY GEIQLRTDRR LDLTDLPTA
|
| |