Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_0039 |
Symbol | |
ID | 7316689 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | - |
Start bp | 39215 |
End bp | 41182 |
Gene Length | 1968 bp |
Protein Length | 655 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643614929 |
Product | histidine kinase |
Protein accession | YP_002512130 |
Protein GI | 220933231 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCCGAA TTCCGGCCGC CCCGAACACG CCCTTGCGCC TTCGCACCAA CATCTTCCTG TGGGTCTCCC TGGCCACCGT GCTGCCGCTC ACGGTGCTGG TGCTGGTGGT CACCGCCTAC AGCGAGCGCC AGTACAGCCA GGACCTGGAC CGCCAGGTGG GTCAGAGCCT CAATACCCTG GTGGCGGAGA TGGACCGGCG CCTGCGCTTC GAGCGGGAGG TGATCGGCGC GCTGGTGAAC TCGGCGCCCA TGCAGGGCTT CCAGTCGGTG CTGGAGCAGG CCCGGGACGG GCGCAAGCAC CCCCAGTTCA ACCTGCGCGC CGCGCAGCTC GGCATCTTCC TGGAAGAGTT CACCGCCCTG ATCCCGGACA TCGGCACCAT CCGCGTGCTG GACACCCGGG GCAACACCCT GATCATGATC CGCGAGCGCC AGGTCATGCC CACCCACTAC CCGGGCATGG ACCCCTACCC CTACGCCGAG GAGGAGCTGG ATGATCCAGC CTTCCTGCAG CGCCTGCGCG CCCTGCCCCG GGGCGAGGTG AGCTACCTGC TGCTGCCTGA GAGCCGCAGC GACTACATCT TCGGCGCCAT CCCGCCCATG TACGACGCCG CCGTGCCCAT CGAACGGGAG GGCGAGGGGG TGGTCGGCTA CCTGCTGGCC AACTCTACCG GCCTGCAGAT CGACCGCATG CTGGAACTGG CCCCGCGCCT CAACGAGGGA CGCCTGAGCC TCGCGGAACT CAATCCCGAT TATCCTTACC GGGACGGCCT GATCCTGTTC GACGAGGCCT CCGGCCTGCT GTTCACCAGC GCCAAGCATC CGGAGATGCG CGTGGCCAAC ACCTTCTGCG AAGGCCTGGC CCGGCAGGTG AACGTGCAAC CCTTCGGCGC CGCCAACCTG CAGACCCGCC CATCGCGGGT GTTCTTCGCC GAGTACCACC CCTACCCGAA CCAGCTGGTC AGCTGGGTGG TGATCTCCGA GATCCCCCAT GACGCCATCG GCGCGCCCTT CAAGCGCATC CGCCAGGGCA TCCTGCTGAT GGCTGGCCTG GCCCTGCTGG TCAGTCTGCT ACTGGCGCAG CTCAGTGCCC GGCGCATCGC CAAGCCCATT ACCCGGCTCA CCCACAACCT CAAGGCCTAC GCCCTGGGTG AACCCCTGGA GCAGAACGAG CCCCACAACA CCCAGGAGAT CCGCGAGCTG CAGGACTCCT TCACCTACAT GGCCGAGACC CTGGAGAAGG CCCGCACCGA CCGCGACCAG GCCCAGCGCA TGCTGCTGCA GTCCGCCAAA CTCGCCTCCA TCGGCGAGAT GGCCGCCGGC ATCGGCCACG AGCTCAACAA CCCCCTGAAC AACATCCTGT CCCTGGCCAA GCTGATCCGC CGGGAACTGC CCGATGACGA CCCGCGCGCC CGGGAAGACC TGCGCGCGCT CACAGACGAG GCGGAGCGGG CCACGCGCAT CGTCAACGGC ATCCTCAACT TTGCCCGCCA GGTGCCGCCC CACTACGTGC CCATCGACGT GCGCCCGTGG CTGGAGGAGA CCCTGATCCT GGTCAACCAG TCCGCCCGGG ACCAGCAGGT CAGCGTGCGC CTGGAGGTGG AGGAGGGCCT GGTCATGGAA GGTGACGTGA ACCAGCTCCG GCAGGTGCTC ATCAACCTGC TGCTCAACGC CATCCAGGCG AGCCAGGCCG GCGACGAGGT GGTGATCATG GCCCGCCACG ACCCGGACGG GGAGGTCAAC GTGTGCGTCT GCGACCAGGG CTGCGGCATC AAGCCGGAGG TCCAGGACCG TATGTTCGAC CCCTTCTTCA CCACCAAGCC CGTGGGCAAG GGCAGTGGCC TGGGCCTGTC CATCAGCCTC GGCATCGTCG AGCAGCACGG CGGGAATCTG GACATCCGTC CCAACGAGCG TGGCGGCGTC ACCGCCACGG TGCGCCTGCC GGCTGCCCGC CGTCAGGCCG CGCCATGA
|
Protein sequence | MARIPAAPNT PLRLRTNIFL WVSLATVLPL TVLVLVVTAY SERQYSQDLD RQVGQSLNTL VAEMDRRLRF EREVIGALVN SAPMQGFQSV LEQARDGRKH PQFNLRAAQL GIFLEEFTAL IPDIGTIRVL DTRGNTLIMI RERQVMPTHY PGMDPYPYAE EELDDPAFLQ RLRALPRGEV SYLLLPESRS DYIFGAIPPM YDAAVPIERE GEGVVGYLLA NSTGLQIDRM LELAPRLNEG RLSLAELNPD YPYRDGLILF DEASGLLFTS AKHPEMRVAN TFCEGLARQV NVQPFGAANL QTRPSRVFFA EYHPYPNQLV SWVVISEIPH DAIGAPFKRI RQGILLMAGL ALLVSLLLAQ LSARRIAKPI TRLTHNLKAY ALGEPLEQNE PHNTQEIREL QDSFTYMAET LEKARTDRDQ AQRMLLQSAK LASIGEMAAG IGHELNNPLN NILSLAKLIR RELPDDDPRA REDLRALTDE AERATRIVNG ILNFARQVPP HYVPIDVRPW LEETLILVNQ SARDQQVSVR LEVEEGLVME GDVNQLRQVL INLLLNAIQA SQAGDEVVIM ARHDPDGEVN VCVCDQGCGI KPEVQDRMFD PFFTTKPVGK GSGLGLSISL GIVEQHGGNL DIRPNERGGV TATVRLPAAR RQAAP
|
| |