Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Strop_3875 |
Symbol | |
ID | 5060353 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora tropica CNB-440 |
Kingdom | Bacteria |
Replicon accession | NC_009380 |
Strand | - |
Start bp | 4435859 |
End bp | 4438927 |
Gene Length | 3069 bp |
Protein Length | 1022 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640476132 |
Product | ATPase domain-containing protein |
Protein accession | YP_001160683 |
Protein GI | 145596386 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.122967 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCACCG GACCGACGAC CCTGCCCGCA AGCGGCGACA TTGACCAGCC GAAGCGGCGC TGGCTGCCTC GGCTCCGCAA TGCACGAATC CGCTCGAAGC TGGCGCTGAT CCTGGTCGTG CCGGTCACCG CGGTCATCGC ACTGGCGACT GTTCGTCTGA TCACAGTCGG CGAGGGCGCG TACGACGTCA CCCGGGCCAA AGCGCTCACC GAACTCTCCA TCGACATCTC CGCCCTCGCT CAGGACCTGC ATGCCGAACG AATGGCTGCT GCGGTCTATA TCGCGTCGAC GGAAGAGACG GAAGCGGCCG CCGACGCCTA CAACCTGCGG GTGCGCAGCA CTGACGAGCG GGTGCAGGCC TACCAGGAGG AGCGTGAGCT CTTCGACGAG CTGCCCGCTG CCGTCAGCGA CCGGGTGGCG GCCATTGACG GGCACCTCGA GACGCTGGAC GCCACTCGGC AGCAGGTGCT GGACCGCAGG CAGATGGCGG TCGCCGAGTC GGCGTTGCGC TACGGCGTCA TCCTGGCCGA CCTGGTGGCC TACGGCGACG GCCTCGCCCA GCTGCCCGGC GACGAGCGGC TGGCGGATGC CCGCCGGGCG GTCGCTGCCT TCGCCCGGGC CAAGGCAGCG GTCGGCGAGG AGGAGTCAGT CGCCTACACC GCGTTGAGCG CGGGCCGCTT CGACGAGGAG CAATACTCCT CCTTCGTGGC CACATTGACC AGTCAGCAGG AGGCGCTGCT CGCCTTCTCC CTGGCGGCGG ACCCACGGCA ACGCTCGCTC GTGGACAACA CCGTCTCGGG TGACTCGGTC ACCCTGGCCG ACACGGTTGC CGCGGACATC ACCCGCTCGG TCGGGCAGCA GTCCCTGGTG AGCGCCCAGG ACGCCAGCGC CGCCATCGGT GCTGTCCACG ACCTCATGCG GTGGACCGAG GCCCGGCTCC AGGAGCAGCT GCTCGCCGAT ACCGAGCAGG ATCGCTCGGA GGTCCTGCGC CAGGCCTTCA TCGAGTCGTT GCTGGTGCTG ATGACCCTGG TCATCGCGGT CACTCTCGCC GTGGTGCTGG CCCGTTCGCT GAACCATTCG CTGTACCGGC TGCGGGAGGG TGCCCTGGCC GTGGCCAACC ACGACCTGCC GGACGCGGTG CGTCGCCTGC AGAGCATGGA GGCGGTTGAC GAGGGCGGGG TCGACGACAT CGTCCGGGAG ATTCGCGACC CGATCCGACT CACCAACCAG GACGAGGTTG GTCAGGTCGC GGTCGCCTTC AACGTGGTCC ACCGGGAGGC GGTCCGGGTG GCGGCGGAGC AGGCCGCTCT GCGGACCAGT GTCTCGGCGA TGTTCCTCAA CCTGGCCCGG CGGAGTCAGA CGCTGGTTGA CCGCATGATC GGGGAGCTGG ACGCGATCGA GCGTGGTGAG GAGGACCCGA AGCGGCTGGC CCGACTCTTC GAACTCGACC ACCTGGCCAC CCGAATGCGC CGCAACGACG AGAACCTGCT GGTCCTTGCC GGGGCCGACT CGACCGTACC CCGGCGGGAA GATGCTCTGT TGGTGGACGT CCTGCGGGCG GCGCAGTCCG AGGTGGAGCT CTACAACCGG ATTGAGTTCG GCACCGTCGA CACGGACGTG TCGGTCGCCG CTCACGTGGT CAACGACGTG GTCCGGCTCG TTGCCGAACT ACTTGACAAC GCGACCCGGT TCTCGCCGCC GAACACCACC GTGGTCGCTG ATGGGCGGCG GATCCGCGAC TACGTGCTGA TCCAGGTCGA GGACCGTGGT CTCGGCCTCT CCGACGAGCA GCTCGACTCG CTCAACCGCC GGTTGGCCGA ATCGTCGAGC GTTGATGTCG CGGCGTTCCG GCTGATGGGC CTGGCCGTGG TGAGTCGACT CGCCGAGCGG CACGGCATCC GGGTCGAGCT ACGCCGCAAC GTTGAGGGCG GCATCGTCGC CCAGGTAACC CTGCCGACGG CCGCGGTTGT GCTGCCCGTC GGGCGGGGAC CAGCCCAGCT CACCCGACCT CGTCAGCCGC TTGCGGTGGA GCAGGGACCA TCCACTTCGG TCGGCATCGG CGACCCGGCG GCGGGCGCCA CGCGGGCCGC GACGTTGCCG GACCAGCGGC AGCCCGAGCC AGCCCCCTGG CAGGCACCGG TCCAGGCCGG CACCAGCGTG GCACCGATCC AGGCCGGCGG GGTTGCCGGG ACCGAGCCCA GCCTCAGCGC CGCAGGCCGT CCCGGTGATG CGCCCACGGC GGCGTACCCA CTGCCCCAAC GGAACCCCTC GGGCGAGTCG TCGGTGGCGA CTGCGGCCTT CCCCACCGTG GCCGCGACCC CTCCGCCGTC AGCCAACCCG CCGTCAGCCA ACCCACTTCC CGCCGACAGC GGGCTGACCG GTGGGCTCGG CGCGGACCTG GCCGCGACCG CGTCGATCGA CGCGACGCCC CTGTCTGCGC CGCCGGCGGA AGCACCGATC TTCCGGGAGA TGGAGGCGGT CTGGTTCCGG ACGCACGGCA ACGATGCGAC GGCCATCTTC ACCCGGCCGG ACTTCGACGG AACGCCCCCG GGCCCGTCTG CGACGCCGGG TTCCCCAGCC CAGCCGCACC TACCCACCCG GGTGCCCGGA ACGACGACGA CTCCGCCAGC TGCCACCCCC GTGCCGTCGT ACTCCGCCGC TCCCACGAGC CCAGCCGCTC CCCCTTCCGC TGGACCGCCG CCCACTGCTC CCCCTTCCGC CGGACCACCG CCCACCGCAC CCCCGACCAG CGTGCCGCCG GCCTCGCCGA GCGCTTCCGG GGCCCCAGCC GGCGGTGGCA ATGCGTGGCG CACCATCGCG GATGAGGGTT GGAGCCGGGC CAGCCAGGCT GCCGAGCCGG CCAGCGGCGG TACCACCCGT TCCGGTCTGC CGAAGCGGGT GCCGAAGGCG CAACTTGTCC CCGGCGGGAT CGAGCCCCAA ACCCGGGAAC GGACCCGCCG GACGCCGGAC AACGTTCGCG GCCTGCTGTC GGCCTATCAC CGTGGCGTGC AGCGCGGTCG TGCGGCCGGC TCGGACCCCA ACAGCACCTC GAGCAAGGAG ACGAGCTGA
|
Protein sequence | MSTGPTTLPA SGDIDQPKRR WLPRLRNARI RSKLALILVV PVTAVIALAT VRLITVGEGA YDVTRAKALT ELSIDISALA QDLHAERMAA AVYIASTEET EAAADAYNLR VRSTDERVQA YQEERELFDE LPAAVSDRVA AIDGHLETLD ATRQQVLDRR QMAVAESALR YGVILADLVA YGDGLAQLPG DERLADARRA VAAFARAKAA VGEEESVAYT ALSAGRFDEE QYSSFVATLT SQQEALLAFS LAADPRQRSL VDNTVSGDSV TLADTVAADI TRSVGQQSLV SAQDASAAIG AVHDLMRWTE ARLQEQLLAD TEQDRSEVLR QAFIESLLVL MTLVIAVTLA VVLARSLNHS LYRLREGALA VANHDLPDAV RRLQSMEAVD EGGVDDIVRE IRDPIRLTNQ DEVGQVAVAF NVVHREAVRV AAEQAALRTS VSAMFLNLAR RSQTLVDRMI GELDAIERGE EDPKRLARLF ELDHLATRMR RNDENLLVLA GADSTVPRRE DALLVDVLRA AQSEVELYNR IEFGTVDTDV SVAAHVVNDV VRLVAELLDN ATRFSPPNTT VVADGRRIRD YVLIQVEDRG LGLSDEQLDS LNRRLAESSS VDVAAFRLMG LAVVSRLAER HGIRVELRRN VEGGIVAQVT LPTAAVVLPV GRGPAQLTRP RQPLAVEQGP STSVGIGDPA AGATRAATLP DQRQPEPAPW QAPVQAGTSV APIQAGGVAG TEPSLSAAGR PGDAPTAAYP LPQRNPSGES SVATAAFPTV AATPPPSANP PSANPLPADS GLTGGLGADL AATASIDATP LSAPPAEAPI FREMEAVWFR THGNDATAIF TRPDFDGTPP GPSATPGSPA QPHLPTRVPG TTTTPPAATP VPSYSAAPTS PAAPPSAGPP PTAPPSAGPP PTAPPTSVPP ASPSASGAPA GGGNAWRTIA DEGWSRASQA AEPASGGTTR SGLPKRVPKA QLVPGGIEPQ TRERTRRTPD NVRGLLSAYH RGVQRGRAAG SDPNSTSSKE TS
|
| |