Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3071 |
Symbol | |
ID | 5706842 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 3480124 |
End bp | 3481233 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641272512 |
Product | histidine kinase |
Protein accession | YP_001537880 |
Protein GI | 159038627 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00000550678 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGAACCTCC GGCGCGTCAG CCTCGCAGTG CGGCTCTTCG CCGCGCAGGT GCTCGCGCTC GCCGTCGGCG GGCTGACCCT GGCACTGGTC GCCGGCGCGG TGGGCCCACG GATCTTTCAC GAGCACCTCG GGCGAGTCGG CGGGGAGGTC AGCGCCGAGG CGCGCTGGCA CGTCGAGCAG GCGTACGCCT CGGCGAACCT GCTCGCGCTC GGTGTCGGTC TGCTCGCCGC GCTGGGCGCG GCGCTGGCGG CCAGCGCGTA CGCGACACGG CGGATCACCC GCCCGGTCAC TCACTTCGCG CACGCGTCGG CCAGCCTCGC CGACGGACAC TACGACGTCC GCATGGCCGA CCCCGGGCTC GGCAACGAGC TCAAGACCCT CGCGGACTCC TTCAACACCA TGGCGGAGGG CCTGGAGACC GTCGAAGCGA CCCGGCGACG GCTGCTCGCC GACCTCGGCC ACGAACTGCG TACCCCGCTC GCCACCATCG AGGCCTACCT CGAAGCGGCT GAGGACGGCG TCGCTGTCGA CGACGAAGAC CTCCAGTCGG TGCTGCGTGC CCAGACCGCG CGGCTGCACC GACTCGCCGA CGACATCGCC GCGGTCTCCC GCGCCGAGAC GCATCAGCTC GACCTGCACC CGGTACGGAC GGCCCCGGCG GACCTGGTCC GGGACGCCGT GGCCGCGGTG CGGCCCCGCT ACGCCGGCAA GGGTGTGACG CTGCGCAGCG ACCTTCGCCC GTCCCCGCAG GTCGATGTCG ACCCACAGCG GATGGGGCAG GTGCTTGGCA ACCTCCTGGA CAACGCGCTA CGGCACACCC CCGAGGGCGG GACCGTCACG GTACACGTGA TCGGCGGTGC CAGAAGTGTG GAGCTGGCGG TGGCCGACAC CGGTCCCGGC ATCCCGGCGC AGCACCTCCC GCACGTCTTC GAACGCTTCT ACCGGGTCGA CACGGCCCGG GACCGGGACA ACGGCGGGTC GGGGATCGGG CTCGCCATCG TGCGGGCCGT GGTCAGCGCC CACGGGGGGC GGGTACGGGC GGATAACGTC CCGGGTGGTG GCACGATGGT CAAGGTGGTC CTGCCACCGT CGGGACAGGG GCAGGTCTGA
|
Protein sequence | MNLRRVSLAV RLFAAQVLAL AVGGLTLALV AGAVGPRIFH EHLGRVGGEV SAEARWHVEQ AYASANLLAL GVGLLAALGA ALAASAYATR RITRPVTHFA HASASLADGH YDVRMADPGL GNELKTLADS FNTMAEGLET VEATRRRLLA DLGHELRTPL ATIEAYLEAA EDGVAVDDED LQSVLRAQTA RLHRLADDIA AVSRAETHQL DLHPVRTAPA DLVRDAVAAV RPRYAGKGVT LRSDLRPSPQ VDVDPQRMGQ VLGNLLDNAL RHTPEGGTVT VHVIGGARSV ELAVADTGPG IPAQHLPHVF ERFYRVDTAR DRDNGGSGIG LAIVRAVVSA HGGRVRADNV PGGGTMVKVV LPPSGQGQV
|
| |