Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Strop_4138 |
Symbol | |
ID | 5060622 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora tropica CNB-440 |
Kingdom | Bacteria |
Replicon accession | NC_009380 |
Strand | - |
Start bp | 4705943 |
End bp | 4707478 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640476400 |
Product | transposase, IS204/IS1001/IS1096/IS1165 family protein |
Protein accession | YP_001160945 |
Protein GI | 145596648 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3464] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.609635 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGAGGACG TCAATAGGCT TGTGCAGGCG GTGTTCTCGG GGCTGTCCCC GCTGGTCATC GAGGACGTGG TCGACGAGGG TGAGCGGATC GTGGTGCGGG CCCGGACGCC GCAGGACACC GCGGTCTGCC CGGTGTGCGG GGCCTCGTCG GGAAGGGTTC ACGGCTATCA CTGGCGGACG GTGGCCGATG TTCCGGTCGA CGGGCGGCGG GTGGTGGTCC GTGTGCGGGT GCGGCGTCTG GTGTGCCCCA CGCGCGGCTG CCACCACACC TTCCGCGAGC AGGTTCCCGG GGTGCTGGAG CGATACCAAC GCCGCACTGT CCGTTTGAAC AGGCAGGTCA AAGCCGTCGT CAAGGAGTTA GCGGGTCGGG CAGGGTCGCG TTTGCTGGCG ATACTGGCCA TGGGCCTGTC CCGTCACACC GCCCTGCGCG CCCTGCTGCG CATCCCGTTG CCCACCGGGC GGACGCCGCG GGTGATCGGC GTTGACGATT TTGCTCTGCG CCGGCGGCAC CGCTATGCCA CCGTGGTGAT CGACGCCGAG ACCCATGAGC GGATTGAGGT GCTGCCCGAC CGCACCGCCG ACACCCTCGA AGCGTGGCTG CGTGAGCATC CCGGCGTCGA GGTGGTGTGC CGCGACGGCT CGGCTACCTA CGCCGAGGCC ATCCGCCGTG CGGTGCCCGA TGCGGTGCAG GTTGCGGACA GGTGGCATCT GTGGCACAAC CTCTGCGAAG CGACCTTGAG CGAGGTGAAG GCGCACAGCT CCTGCTGGGT GACCGTACTG GACGCGCCCA TCTACGACGG GCCCCGCGCG CAGACGACCC TGGAACGCTG GCATCAGGTC CATGACCTGC TCGATCAGGG CGTCGGCCTA CTCGAATGCG CCCGTCGCCT GCAGTTGGCT CTGAACACCG TCAAACGCTA CGCGCGAGCC GACCGACCCG AGCGGATGCT CCGCGTCCCC AAATACCGTG CCGGCCTCGT CGACCCCTAC CGTGAACACC TGCGCAAACG TCGAGCCGAG GACCCCGGCG TCGGCGTCAA GCACCTCTTC GAAGAGATCA AGGCACTCGG GTTCACCGGC TGTCTGAACC GGCTGCACAA GTACATCAAC CAGGGCCGCG CCGACGCAGA CCGCAGCCAC ATCTCCCCGC GCCGGCTCGC CCGGATGATC CTCACCAGGC CCGGCAACCT CAAACCCGAG CACCGTGATC TCCTGGCACG GCTCACCGCC GCCTGCCCAG AGATGACCCA ACTGGCCGCC GCAGTCGGAC GCTTCGCCGC ACTCCTGACG CCACAGCCGG GAAACGCCGA CCGGCTCTCG CTCTGGATCG TCCAGGTCCG CGCGGTCGAC CTACCTCATC TGCACGCCTT CACCCGAGGC CTGGAACGCG ACCGCGACGC CGTGAACGCC GCGCTCACGC TTCCCTACAG CAACGGCCCC ACCGAAGGCG TCAACACCAA GACCAAACGG ATCGCACGCC AAATGCACGG ACGAGCAGGC TTCACCCTGC TCCGCCACCG CATCCTCCTC GCATAG
|
Protein sequence | MEDVNRLVQA VFSGLSPLVI EDVVDEGERI VVRARTPQDT AVCPVCGASS GRVHGYHWRT VADVPVDGRR VVVRVRVRRL VCPTRGCHHT FREQVPGVLE RYQRRTVRLN RQVKAVVKEL AGRAGSRLLA ILAMGLSRHT ALRALLRIPL PTGRTPRVIG VDDFALRRRH RYATVVIDAE THERIEVLPD RTADTLEAWL REHPGVEVVC RDGSATYAEA IRRAVPDAVQ VADRWHLWHN LCEATLSEVK AHSSCWVTVL DAPIYDGPRA QTTLERWHQV HDLLDQGVGL LECARRLQLA LNTVKRYARA DRPERMLRVP KYRAGLVDPY REHLRKRRAE DPGVGVKHLF EEIKALGFTG CLNRLHKYIN QGRADADRSH ISPRRLARMI LTRPGNLKPE HRDLLARLTA ACPEMTQLAA AVGRFAALLT PQPGNADRLS LWIVQVRAVD LPHLHAFTRG LERDRDAVNA ALTLPYSNGP TEGVNTKTKR IARQMHGRAG FTLLRHRILL A
|
| |