Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_0370 |
Symbol | |
ID | 8806105 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | + |
Start bp | 390955 |
End bp | 392448 |
Gene Length | 1494 bp |
Protein Length | 497 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | |
Product | RNA polymerase, sigma 54 subunit, RpoN |
Protein accession | YP_003459621 |
Protein GI | 289207555 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.273387 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGAAGC CCGGGCTTAA TCTGCAGCTG GGGCAGACGC TGACGATGAC CCCGCAGCTG CAACAGGCGA TCCGTCTGCT GCAGCTTTCC ACGCTGGAGT TGCAGGCGGA GATCCAGCAG GCGCTGGAAA GCAACCCCAT GCTCGAGCAG GTGGAGGACG AGGGCGAGGA GGCCCCGCCC GAGCGCCCGG AGGAAGGGGG CGAACGGGTC GACGGGGATC GCATGCAGGC CGACACGGCT GGCGATGACG TGATCCCCGA GGAGTTGAGC ACCGACAGTC GCTGGGACGA CATCTACGAG CCCGCCCCGA GCCCACGCGG CCTGGACGAC CGCGACCCGC TGGAGAATAC CTCCGACGGC GAGGACGAGG GCCTGCAGGA CCACCTGCTC TGGCAGCTGC ATCTGAGCCA CCTGACCCCA CGAGACCAAC GGATCGGGGC GGCGCTGATC GACGCGATCA GCCCGGACGG CTATCTGGAC AGTGAACTCG AGGCGCTGGC GGAGATGATC AGCCAGGGCG AAGACGAGCC CGTGGGGGTC GACGAGGTCG AGGCGGTGCT GCACTGGATT CAGCAGTGCG ACCCGCTGGG CGTCGGTGCG CGGGACCTGC GCGAGTGCCT GGAGATCCAG CTGCGCCATT TGGCGCCGGA TGCCCCGGCG CGTGACGCGG CTCGAGCCAT CCTGGAAAAC GGCATGGAGA GCCTGGCCAA GCGGGACTAC CGAGCCCTGC AGCGCCAGTC CGGCGTGCGC GACCCGGACG AGCTGGCGCG GGCGCTGGAC CTGATCCGCA CCCTGAATCC GCGGCCCGGT GCCCAGATCA GCGAGTCCAC TTCGGAATAT GTGGTGCCGG ATGTCTATGT CCGCCATGAC CGCGATGGCT GGCGCGTGGA CTTGAACCCG GAGATTGCGC CGCGCATTCG CATCAACGAT CTCTATGCGG GCATGGTGCG GCAGGTCTCC GATGCGCGTG ATAGCGCGTT TATGCGCGAT CAGTTGCAGG AGGCGCGCTG GTTCCTGAAG AGCCTGCACA GCCGGAACGA CACCCTGCTG CGGGTCGCCC AGGCCATCGT CGTGCGCCAG CAGGGGTTTC TCGAGCATGG TGAGGTTGCC ATGCAACCGC TGATCCTGCG CGATATCGCC GAGACCCTGG AGATGCACGA ATCGACCATC TCGAGGGTGA CTACACAGAA ATACATGCAC ACCCCGCGTG GGGTATTCGA GTTCAAGTAC TTCTTTTCCA GTCATGTTGG CACAGCCGAT GGCGGCGAAT GCTCGGCGAC CGCGATCCGT GCCATGATTC GGGAATTGAT TGGTGGCGAG ACCCCGAACA AGCCACTGAG TGATGCAAAA CTGGCCCAGA TCCTGTCGGA TCGGGGCATT AATGTGGCCC GACGCACCGT GGCCAAGTAC CGCGAGGCTA TGCACATACC GTCTTCCAGC GAACGCCGGC AGATGGCGAG TATGCCCAGT GGAACCAAGC AAAAAGGAGC CTGA
|
Protein sequence | MLKPGLNLQL GQTLTMTPQL QQAIRLLQLS TLELQAEIQQ ALESNPMLEQ VEDEGEEAPP ERPEEGGERV DGDRMQADTA GDDVIPEELS TDSRWDDIYE PAPSPRGLDD RDPLENTSDG EDEGLQDHLL WQLHLSHLTP RDQRIGAALI DAISPDGYLD SELEALAEMI SQGEDEPVGV DEVEAVLHWI QQCDPLGVGA RDLRECLEIQ LRHLAPDAPA RDAARAILEN GMESLAKRDY RALQRQSGVR DPDELARALD LIRTLNPRPG AQISESTSEY VVPDVYVRHD RDGWRVDLNP EIAPRIRIND LYAGMVRQVS DARDSAFMRD QLQEARWFLK SLHSRNDTLL RVAQAIVVRQ QGFLEHGEVA MQPLILRDIA ETLEMHESTI SRVTTQKYMH TPRGVFEFKY FFSSHVGTAD GGECSATAIR AMIRELIGGE TPNKPLSDAK LAQILSDRGI NVARRTVAKY REAMHIPSSS ERRQMASMPS GTKQKGA
|
| |