Gene Strop_4134 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_4134 
Symbol 
ID5060618 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp4699603 
End bp4702689 
Gene Length3087 bp 
Protein Length1028 aa 
Translation table11 
GC content68% 
IMG OID640476396 
Producttransposase Tn3 family protein 
Protein accessionYP_001160941 
Protein GI145596644 
COG category[L] Replication, recombination and repair 
COG ID[COG4644] Transposase and inactivated derivatives, TnpA family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.160281 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGTCGA TCGAGCGGAC CGCGTATCCG CGGTTCAAGC GGTTTCTGTC GGCCCGGGAA 
TTGCACGTGT TCTACACGCC GCAGCCGGAG GAGATCGCGT GGACGAGCGG GCTGGTGCGC
TCGGACAGTC ATCTGCTGGC GTTCATGGTG CAGCTGAAGT GCTTCAACCG GATGGGGTAC
TTCCCGCGGC TGGATGAGGT CCCGGAGGCG GTGGTGGCCC ACATCCGGCG GGATCTGGGC
CTGGGTGAGG ACGTCGCCGC GGTGTACGAC TCGGAGCGGA CCTGGGGCCG TCATCGGCTG
CTGATCCGCC GACGTAGTGA AGTCGTGTCG GACATGCCGG CGGCCCGGGC GGTGGCCGCC
GCGGCGATCC GGGAGGCTGC CGGGCGTAAG AACGATCCGG CCGACCTGAT CAATGTGGCG
TTGGAGAAGC TGGTCGAGGG CTCGTTCGAG CTGCCGGGTT ACACGACGTT GGACGAGATG
GCCTCGGCGA TCCGCGAGGA GGTCAACTCG GCGATCTTCG CGCTGGTCGT CGAGCGCATC
GGTCCGGCAG GTGTGGCCGG GTTGGATCGG ATGCTGATTA CGGCGGGTGG TCCGGGGAGC
AAGAGCGACT ACAACCGGTT GAAGCGGACC GCGCCGCGGC CGTCGTGGAC GAACTACCGG
CTGCAGATCG AGCATCTGCG CTGGGCCGAC AGTCTTGGTG ACTCGCGGTC CTGGTGGGAG
GGCATCGCGC GATCGAAGAT CGCCGACTTT GCCGGGGAGG GCGAGGCCGG TGACGCCGCG
GTGCTGGGCG ACTACGGGGA CGCGAAGCGT ATCGCGATCC TGGCGGCGAT GGTTTACGCC
GCGCAGCAGC GGGCCCGCGA CGACACAGCG GAGATGTTCT GCCGGCGGGT CGGCACCCTG
ACCAAGCGGG CCCGGCTGGA GCTGGAGGAG CTGAAGAAGA AGCAGCAGAA GGTCACCGAG
GCGCTGATCG TCAACTACCG GCAGGTCCTG GAGCACCTCG ACCCGTACGG TCCGGCCGCG
GCCCAGCACG CCGCGACGCT GGAGATGGCC CGCAAGACAG TCGAGGCCGC GGGCGGTTTC
CCGGAGGAGC TGGCCCGCAT CGATGCGGTC CGGGCGACCC ACGGCGACAA CCATGTGCCG
CTGGTGGCCC GGCATTTCCG CAAGGACCGG TCGTCGATGC TGGCCATGGT GGGCGTCCTG
GACTTGGAGG CGACCAGCGC GGACCGCAGT GTGTTGCAGC TGCTGGACTA CATGCGTGAG
CACACGATGC TGACCCGCGA TCACATCCCT GACCGGATCT CGGTGTTAGA CGAGCAGGGC
CGGCCGGTGA CCTACCCAGA GACGGGTGAG CAGCGGATCC ACGTGTTTGA CACGTCGTTC
GCGTCGGAGA ACTGGAACAG GTCGATCCGG GACCGCAGCC GGCCCGGCAT GTTCGTACGC
CGGCACTTGG AGGCGTGCGT GCTGACGTAT CTGGCCGAGG AACTGCGGAC CGGCGACATC
GCTGTGACCG GCGCGCAGGC GTACGCGAAC TGGGCTGATC AGCTGCTGTC CCCGGACGAA
GTCGCCGCGA TGCTGCCGGG CTTCTGTGCC GAGGTCGGTA TCCCGGCGAC AGCCGCCGGG
TTCCGCGCGG ACCTGCACGA GCGCCTCGAC GCGCAGTGCC GGGCAACCGA CAGCGCGTAT
CCGGACCTGG CCGACTTCAC CATCGACGAG CTCGGCCGTC CATCGCTCAA GCAGCTGCGA
GCGGCGCCGC CCACACCGTC GGCGCAGGCC ATCGCGCTCG CCGTGCGGGA CCGGATGCCG
GAGCGCACGC TGATGGGGAT CCTGGCCCGC ACCGGGCACT GGCTGGACTG GTGGCGCCGG
TTCTCGCCCG TGTCGGGTTC GGATCCGAAG CTCAAAGACC CGTTCGTGCG CTACATCCTG
ACCACGTTCA CGTACGGCAC GAACCTGGGG CCGGCGCAGG CCGCCCGGCA CATCGCCGGG
GTCAGCGCCC ACGAGCTGGC CACCACGTCG GCGCGGCACG TCACGATCGG CAAGCTGAAC
AAGGCCATCG CCGACGTCGT CGATGCGTTC ACCGAGCTGG ACCTAATCAA GGTGTGGGGC
GACGGATCGG TGGTCGCCGC GGACGGTACC CAGGTGGACA CGTTCATCGA CAACCTCCTG
GCGGAGACGT CGATCCGGTA CGGCGGCACC GGCGGGATCG CGTACCACTA CGTGTCGGAC
ACCTACATCG CGTTGTTCTC CAAGTTCATC CCGGTCGGGG TGTGGGAGGC CGTGCACATC
ATCCAGGGCC TGCTCGACCA GCAGTCCAAG GTGCGGCCGG GCACGATCCA CGCCGACACC
CAGGGCCAGG CGCTGCCCGT CTACGCGCTC GCGCATTTGT GCGGGTTCGA GCTGATGCCG
CGGGTGCGTA ACTGGAAGGA TCTCAACTTC TACCGCACGT CGGCGGCCAC CCGGTTCCGG
CACATCGAGG CCCTGTTCGG CGAGCCCGGC CGCAACGTCA TCGACTGGGA CCTGATCGAA
CGCCACTACG ACGACCTGAT GCGGATCGTG CTTTCCGTCG CGGCCGGGAA GATCTCATCC
GTGACGTTGC TGCGCCGGCT GTCGACCTAC TCCCGGCGCA ACAACTTCTA CAAGGCCTTC
CGCGAGGTCG GCCGGGTCAT CCGCACGATC CAGCTACTGC GCTACCTGTC GGACCCGCAG
CTACGCCGGC GGACCACCGC GGCGACCAAC AAGGTCGAGT CCTACAACAA CTTCTCCGCC
TGGTGCCGAT TCGGCAACGA GGGCCGCGTC CGCGACAACG ACCCCGCCGA GCAGGAGAAA
CACATCAAGT TCTCCACCCT GCTGACCAAC GCGGTCATCT TTCACACCAC CCTGGACATG
ATGAGCGTGC TCCGGCAACT CGCCGGTGAG GGCTGGGAGA TCAAACCGGA GGACCTGGCC
GTGCTGTCGC CGTACCAGAC GATGCGGATC AACCGGTTCG GCGTCTACGC CACCGACGAG
ATCACCATCA CCCCCGAGCA GTACGACGCG CACCTACCCG ACATCGACCT CACCATCCCG
GAGCCCGTAC CGTCACCCGC CCGGTGA
 
Protein sequence
MTSIERTAYP RFKRFLSARE LHVFYTPQPE EIAWTSGLVR SDSHLLAFMV QLKCFNRMGY 
FPRLDEVPEA VVAHIRRDLG LGEDVAAVYD SERTWGRHRL LIRRRSEVVS DMPAARAVAA
AAIREAAGRK NDPADLINVA LEKLVEGSFE LPGYTTLDEM ASAIREEVNS AIFALVVERI
GPAGVAGLDR MLITAGGPGS KSDYNRLKRT APRPSWTNYR LQIEHLRWAD SLGDSRSWWE
GIARSKIADF AGEGEAGDAA VLGDYGDAKR IAILAAMVYA AQQRARDDTA EMFCRRVGTL
TKRARLELEE LKKKQQKVTE ALIVNYRQVL EHLDPYGPAA AQHAATLEMA RKTVEAAGGF
PEELARIDAV RATHGDNHVP LVARHFRKDR SSMLAMVGVL DLEATSADRS VLQLLDYMRE
HTMLTRDHIP DRISVLDEQG RPVTYPETGE QRIHVFDTSF ASENWNRSIR DRSRPGMFVR
RHLEACVLTY LAEELRTGDI AVTGAQAYAN WADQLLSPDE VAAMLPGFCA EVGIPATAAG
FRADLHERLD AQCRATDSAY PDLADFTIDE LGRPSLKQLR AAPPTPSAQA IALAVRDRMP
ERTLMGILAR TGHWLDWWRR FSPVSGSDPK LKDPFVRYIL TTFTYGTNLG PAQAARHIAG
VSAHELATTS ARHVTIGKLN KAIADVVDAF TELDLIKVWG DGSVVAADGT QVDTFIDNLL
AETSIRYGGT GGIAYHYVSD TYIALFSKFI PVGVWEAVHI IQGLLDQQSK VRPGTIHADT
QGQALPVYAL AHLCGFELMP RVRNWKDLNF YRTSAATRFR HIEALFGEPG RNVIDWDLIE
RHYDDLMRIV LSVAAGKISS VTLLRRLSTY SRRNNFYKAF REVGRVIRTI QLLRYLSDPQ
LRRRTTAATN KVESYNNFSA WCRFGNEGRV RDNDPAEQEK HIKFSTLLTN AVIFHTTLDM
MSVLRQLAGE GWEIKPEDLA VLSPYQTMRI NRFGVYATDE ITITPEQYDA HLPDIDLTIP
EPVPSPAR