Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3106 |
Symbol | |
ID | 5706580 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 3529981 |
End bp | 3531336 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641272539 |
Product | transposase IS4 family protein |
Protein accession | YP_001537907 |
Protein GI | 159038654 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.99267 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0276245 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGATGAGA TTGCCATAGC CCGGACGGTC ACGGTAGCTG CCGGGTCGTT CGCGGCGGGT CACCTCGGTG AGCTGACCCG CCTCGTGCCG TTCGAGATGA TCGATGACGT GCTGGCTGCG ACCAGGCGCA CACAGCGACG TGTCCGCCTG TTGCCGGCCC GGGTCGTGGT GTATCTGCTG CTGGCCGGCT GCCTGTTCGC CGATTGCGGC TACCGGCAGG TGTGGGCGAA ACTCGTCGCC GGTCTGCGCG GGCTGCCAGT CGCCGATCCC AGCGACAGCG CGTTGCGGCA GGCCCGGCAA CGACTCGGCC CGGCCCCACT GCGGGCCTTG TTCGACCTGC TGCGCGGCCC CGCGGCCACC AGTGCTGTCG CTGCGGTGCG ATGGCGGGGA CTGCTACCGG TCGTCGTCGA CGGCACCATG ATCGCTGTCG CGGACTCGCC GGCCAACCTG GGTCGCTACG GCAAACACCG GTGCAACAAC GGCGGCTCCG GCTACCCCAC ACTGCGGCTG AGCGCCCTGT TGACGTGCGG CACCCGCTCG GTCATCGACG CCGTGTTCGA CCCGAGCACC ACCGGCGAGA TCACCCAAGC CCACCGGCTG ACCCGCAGCC TGCGCGCCGG AATGCTGCTG CTGGCCGACC GCAACTACGC CGCCGCCGAC CTAATCGGCG CGTTCACCGC CACCGGAGCG GACCTGCTGA TCCGCTGCAA GAGCGGCCGG AAACTCCCGA TGACCCGCCG CTGTCGAGAC GGATCCTGGC TGTCGGTCAT CGACGGCCAG CCGGTGAGGA TCATCGAGGC CCGGATCAGC ATCACCACCA CGGCCGGCAG CCACACCGGC GACTACAGGC TCATCACCAC CCTGCTCGAC CCACGTCGCT ACCCCGCCGC CGACCTCGTC CGCCTCTACC ACCAGCGGTG GGAAATCGAG ACCGCCTACC TGGAACTGAA GTCCACCATC CTCGGCGGCC GGGTGCTGCG CGCCCGCACC CCCGACGGCG TCGACCAGGA GATCCACGCC CTGCTCATCG TCTACCAGGT GCTGCGCACC GCCATGGTCG ACGCCACCGA CAGCCGGCCC GGCCTCGACC CGGACCGGGC CAGCTTCACC ACCGCCCTGC ACGCCGCCCG CGACCAGATC ACCCAGGCCG CCGGCATCAT CGCCGACACC GTCATCGACC TGGTCGGCGC CATCGGTGAA CGCGTCCTGA CAGACCTGCT GCCCGACCGT CGCATCCGAT TCAAGGCCCG CATGATCAAA CGCTCGAACT CCAGGTACCA GGCCCGCGGA CCCCGGATCG ACCGCCGAAC CTACAAGGCC ACCACCAGCA TCGACGTCAT CACCAACGAC CCTTGA
|
Protein sequence | MDEIAIARTV TVAAGSFAAG HLGELTRLVP FEMIDDVLAA TRRTQRRVRL LPARVVVYLL LAGCLFADCG YRQVWAKLVA GLRGLPVADP SDSALRQARQ RLGPAPLRAL FDLLRGPAAT SAVAAVRWRG LLPVVVDGTM IAVADSPANL GRYGKHRCNN GGSGYPTLRL SALLTCGTRS VIDAVFDPST TGEITQAHRL TRSLRAGMLL LADRNYAAAD LIGAFTATGA DLLIRCKSGR KLPMTRRCRD GSWLSVIDGQ PVRIIEARIS ITTTAGSHTG DYRLITTLLD PRRYPAADLV RLYHQRWEIE TAYLELKSTI LGGRVLRART PDGVDQEIHA LLIVYQVLRT AMVDATDSRP GLDPDRASFT TALHAARDQI TQAAGIIADT VIDLVGAIGE RVLTDLLPDR RIRFKARMIK RSNSRYQARG PRIDRRTYKA TTSIDVITND P
|
| |