Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_1372 |
Symbol | |
ID | 8807138 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | + |
Start bp | 1465916 |
End bp | 1467226 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | |
Product | response regulator receiver modulated diguanylate cyclase |
Protein accession | YP_003460614 |
Protein GI | 289208548 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.738866 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 0.941955 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGATCC TGGTCGTCGA TCGCTCGCGC GTGTTCCGGG CACTGTGGAG CCGCATGGTG GGCACCGCCG GGCATGAAGC CAGCATGGCC GGCACCGGCG CCGAGGGGCT CGCACGTCTG GACATCTTCC GCGCAGAGCT GGCCTGCGTT TCGCTAAGCC TGCCCGACAT GGATGGCCTG GAATTCTGCC GCGCGGCGCG CAAGAGGCGT TACGGCCGCG GGCTCCCGCT GATCATCCTT ACGTCCAACC CTGATATGCG CATTCGCGAA CGCGCCTTCG CCGCGGGCGC GACCGATGTC CATGACCGAA CCAACATTGG GGAACTGTTC CAGCAGGCCG AACGCTACGC GCAGCCGCTG GAGAGCCTCC TCAGCGGACG GATTCTGTAT GTGGAGGACA GCCCCACTGC CGCACGCTAC CTGCGTCGCG CCCTGGAGCC CATGGGCCTC GATATCGTGC ACTACACCAG TGCCTCCGAT GCCCTCGCGG CCTTCGACCC CTCGCAGTTT GACCTGGTCC TGAGCGACAT CCTGGTTGAG GGAGAGCTGA GCGGAATGGG GCTTGTGGGT TGCATCCGGC AGCAATGCCC GGACATGACC GAGATGCCAA TCGTCGCCAT GTCGGGCCTG GATGACCAGC ATCGACGGGC GGAAATTTTT CGACTCGGTG TCAATGACTT CGTTACCAAA CCTTTCCAGG GAGATGAACT TCGGGCGCGC CTGTACAACC TGTTGAACAA CAAACGCCTG ATGGAAGAGG TCCGCCTTCA GCGCCAGAAG CACTACGAGA TGGCGATGGT TGATCCACTG ACCGGTCTGA GCAATCGCAA TGCCCTGACG GAGTTCGCCA GCCGATTCTT CGATGAACCC TGCAGCCACG ACCAACCGCT GGCCCTGATC CTGATGGACC TCGACCACTT CAAGGACATC AATGACCGTC ACGGCCACCT GGTCGGCGAC GAGGTCCTCG CGGCCATTGG CCAACTGCTG CGCGCGGTTG CCGCGCCCGG CGAGATCCCC GTCCGCTTCG GCGGCGAGGA GTTGCTGATG CTAATGCCAG GCAGCGACTT GTTCGATGCC CATACCCGTG CCGAGGACCT GCGCCAGCGG GTCCAGGATC TGCACCCGGC GGGCCTGCCG CTCACCGCGT CCTTCGGAGT CACCAGCCGA GAGCCCGGGA GTGATGCCGA ACTCAACGAC CTGTTCCGCG CAGCCGACGA GGCGGTCTAC AACGCCAAGG CCCAGGGGCG GAACTGCGTC GTCTCCCTAC CCGGAAAGCA TCACAACTCT GCCGCGGAAT ACTACGGCTG A
|
Protein sequence | MRILVVDRSR VFRALWSRMV GTAGHEASMA GTGAEGLARL DIFRAELACV SLSLPDMDGL EFCRAARKRR YGRGLPLIIL TSNPDMRIRE RAFAAGATDV HDRTNIGELF QQAERYAQPL ESLLSGRILY VEDSPTAARY LRRALEPMGL DIVHYTSASD ALAAFDPSQF DLVLSDILVE GELSGMGLVG CIRQQCPDMT EMPIVAMSGL DDQHRRAEIF RLGVNDFVTK PFQGDELRAR LYNLLNNKRL MEEVRLQRQK HYEMAMVDPL TGLSNRNALT EFASRFFDEP CSHDQPLALI LMDLDHFKDI NDRHGHLVGD EVLAAIGQLL RAVAAPGEIP VRFGGEELLM LMPGSDLFDA HTRAEDLRQR VQDLHPAGLP LTASFGVTSR EPGSDAELND LFRAADEAVY NAKAQGRNCV VSLPGKHHNS AAEYYG
|
| |