Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_0346 |
Symbol | |
ID | 5708018 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 383168 |
End bp | 386074 |
Gene Length | 2907 bp |
Protein Length | 968 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641269872 |
Product | transposase Tn3 family protein |
Protein accession | YP_001535267 |
Protein GI | 159036014 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4644] Transposase and inactivated derivatives, TnpA family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00623467 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGCGACTC AGGACGTGTT CTCGGCGGAG GAACTGTCGC GGCTTCGGGG CTTCCCGGAG ATCAACCGGG CCGAGCTGGT CCGATACTTC ACGTTGACCG GTGCGGACGA GGCGTTCGTG CGCCGGTTCC GCATGGGCCG CAACGTGTTG GGCGTGGCGG TCCAGCTGTG CACGTTGCCG TGGCTGGGGT TCGTGCCCGA TGAGGTGCCG ACGGCACCGG CGGCCGTGGT GGGCCGGTTG TCGCAGCAGC TTGGGATCGC GATGGGTGAG TTGCGCGGGT ACGGCGAGCG GGAGCAGACC CGGACGGATC ATCTGCGCGA GGTGGCCGGC TACGCCGGCT GGCGGTCGAT GGGCGCCGCC GAGTGGAAGG ACCTGGACGA GTTTCTGTTC GCCCGGGCGA TGGAGCACGA TTCGCCGAAG CTGCTGTTCC GGTTGGCCTG CGAGTACCTG CTGTCGTCGC GGGTGATCCG GCCGGGGGTG ATCCTGCTGC TGCGGCGGGT GGCCGCGGCG CGGGCCCGGG CGCGGACGCA GACGTGGGCG CGGGTGCGGC ATCTGCTGAC CGACCGGCGG TGCGCGGAGC TGGACTTGCT GCTGGTTCCG GACGCCAATC TCGGCCGGAC GCCACTGGCT TGGCTGGGTG TGGGGCCGTC CTCATCGAGC CCGGCCGCGG TCAGGGCAGA GCTGGAGAAG CTGGCCTACC TGCGCCGCCT GGACGCGCAT ACCTTGGACC TGTCGATGCT GCCGGCCGAG CGGCGCCGGT TTCTGGCCGG GGTGGGTCGC CGGTTGACCG GGCAGGCGTT GCAGCGGCGC GAGCCGGAGC GCCGGTACCC GATCCTGCTG ACGCTGCTGG CGCAGTCGGC GGTCGACGTG CTCGATGAGA CGCTGCTGTT GTTCGACCAG GCGATCAGCG GGCGGGAGGC GGCGGCGGAA CAGAAGGTCG CAGCGGCCCT GGCCGAGCGG GCGAAGGGCG GGGAGAACCG GCAGGCGCTG CTGGACGAGA TCCTGACGAT CGTGCTCGAC ACCGGGATCG GCGACGAGCA GGTCGGCACC CTGCTGCGCA CCAGCGTCGG CCTGGACCGG ATGCGCGCGG CGTGGGCCGA GCGGCGCGAG CGGCTGCCCC GCGACCACGG GCAACTCAGC ATGATGGACG CGTCGATGTC GTATCTGCGG CAGTTCGCCC CGGCCGTGCT GGCGGCGGTG CGGTTCGCCG GCGGGCCCGG CACCGAGCAG CTGCTGCAGG CGGTCAGCCT GCTGGCCGGG CTGTACGCCA CTGGCGCCCG TAAGGTTCCG GCCGGTGCTC CGGTCGGGTT CGTGCCGGCG ACATGGGCCG GCTACCTGGT CGCCGCGGAG CAGGCCGGCG ACGTCACCGC CTACCGGCGG TACTGGGAGC TGGCCGTGCT GGTCGGCCTG CGCGACGGGC TGCGCTCCGG CGACGTGTTC GTACCCGGGT CGCGCCGGTA CGCCGATCCG GCGTCGTTTC TGCTCACCCC CGAGGCGTGG GCGCCGCAGA GGGTGGAGTT CTGCCACTTG GTAGGCAAGC CGGTCGAAGC CGTCGACGCG CTGGTCCAGG CCGACGAGGA GTTGCACATC GCGCTGGCGG ATCTGGAGTC GCAGCTGGCG AAGGGCGACC CGGGTGAGGT CCGGCTCACC GACGACGGGG AGCTGATCAT CCCGCCGCTG ACTGCCGAGG ATGTGCCCGC CGAGGCCGAC GCGGTGCGCG CGGAGTTGGC CGGGATGCTG CCTCGGGTGC CGATCGCGTC GGTGCTGGTG GAGGTCGACG CACGGACCGG GTTCACCGAG CACCTGGTGC ACGCCGGCGG AAAGGTGAAC CGGCCGGCTG AGCTGAAACG CAACCTGCTG TACGTGCTCA TCGCGGAGGC CACGAACATG GGCTTGTCGG CGATGGCGGA GTCCTGCGGC GTGCCGTACG ACATGCTCGC GTGGACCGCG GAGTGGTACT TCCGGCCGGA GACTCTGGAG GCCGCGAACG CCGCAGTGGT CAACTACCAC CACCGGCTCC CGTTCACTCA GGCGTTCGGG GCGGGCACCC TGTCGTCCTC GGACGGGCAG CGGTTCCCGG TCAAGGGCAG GAGCATCACC GCCCGGCACC TGTCCCGGTA CGTCCCCCGC GGCCAGGGTG ACTCCACCTG TACCCACGTC TCCGACCAGC ACTCGACCGT CGACCCGAAG GTCATCGTGG CGACCGCGCC GGAGGCGCAC TACGTGCTCG ACGGGCTTCT GGGCAACGCC ACCGACCTAC CCGTGTCCGA GCACGCCACC GACACCCACG GCGCCACCCT GGCCAACTTC GCCCTGTTCG ACCTGGTCGG CAAGCAGCTT TCGCCGCGCA TCCGCGACCT CGGGAAGATC ACCCTCTACC GGACCGGCCC GAACGCCGAC GTCCTCGCCC GCTACCCGCG CGCCGGCGGC CTGCTGACCC GACGCCTGAA CACCGACCTG ATCACGAGCA CCTGGGACGA TCTGCTGCGC GTCGCCGCCT CGGTGCAGGG CGGCCACGCC ACCGCAGCGC TGGTCGTCGG GAAGCTGTGC TCCTCGAAAC GGCAGCAGAA TGCGCTGACC AGCGCGATCA AGGAATACGG GGCGCTGCGT CGCACGGTCT ACGCCGCCAG GTATCTGGCC GACGAGACCT ACCGGCGGCG GATCTCCCGG CAGCTCAACA AGGGCGAGAA CCTGCACGCC CTGCGCCGCT GCCTGGCCTA CGCCGGCGAG GGAGCGCTGC GCCGCCACCA CGAGCAGCAG ACCGAACAGA TGTGGTGCCG CACGCTGGTC ACCAACGCAA TCGTCTGCTG GTCCACTGAG TACCACAGCC TCGCTGCCGG TGCGCTACGC CGCGACGGCC GTCAGGTCGA CGACGAAGTC CTGGCGCGCC TCCACCATCC CACATGA
|
Protein sequence | MATQDVFSAE ELSRLRGFPE INRAELVRYF TLTGADEAFV RRFRMGRNVL GVAVQLCTLP WLGFVPDEVP TAPAAVVGRL SQQLGIAMGE LRGYGEREQT RTDHLREVAG YAGWRSMGAA EWKDLDEFLF ARAMEHDSPK LLFRLACEYL LSSRVIRPGV ILLLRRVAAA RARARTQTWA RVRHLLTDRR CAELDLLLVP DANLGRTPLA WLGVGPSSSS PAAVRAELEK LAYLRRLDAH TLDLSMLPAE RRRFLAGVGR RLTGQALQRR EPERRYPILL TLLAQSAVDV LDETLLLFDQ AISGREAAAE QKVAAALAER AKGGENRQAL LDEILTIVLD TGIGDEQVGT LLRTSVGLDR MRAAWAERRE RLPRDHGQLS MMDASMSYLR QFAPAVLAAV RFAGGPGTEQ LLQAVSLLAG LYATGARKVP AGAPVGFVPA TWAGYLVAAE QAGDVTAYRR YWELAVLVGL RDGLRSGDVF VPGSRRYADP ASFLLTPEAW APQRVEFCHL VGKPVEAVDA LVQADEELHI ALADLESQLA KGDPGEVRLT DDGELIIPPL TAEDVPAEAD AVRAELAGML PRVPIASVLV EVDARTGFTE HLVHAGGKVN RPAELKRNLL YVLIAEATNM GLSAMAESCG VPYDMLAWTA EWYFRPETLE AANAAVVNYH HRLPFTQAFG AGTLSSSDGQ RFPVKGRSIT ARHLSRYVPR GQGDSTCTHV SDQHSTVDPK VIVATAPEAH YVLDGLLGNA TDLPVSEHAT DTHGATLANF ALFDLVGKQL SPRIRDLGKI TLYRTGPNAD VLARYPRAGG LLTRRLNTDL ITSTWDDLLR VAASVQGGHA TAALVVGKLC SSKRQQNALT SAIKEYGALR RTVYAARYLA DETYRRRISR QLNKGENLHA LRRCLAYAGE GALRRHHEQQ TEQMWCRTLV TNAIVCWSTE YHSLAAGALR RDGRQVDDEV LARLHHPT
|
| |