Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_1029 |
Symbol | |
ID | 7316606 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | - |
Start bp | 1116305 |
End bp | 1119265 |
Gene Length | 2961 bp |
Protein Length | 986 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643615915 |
Product | two-component sensor CbrA |
Protein accession | YP_002513103 |
Protein GI | 220934204 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG0591] Na+/proline symporter [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.211087 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGCTG AACTGGGGAT CCTCTATGCG GTAGGCGTGG CGTACCTGGC GTTGCTGTTC TTCGTGGCAC ATGCCACGGA GAAAGGCTGG CTGCCGGACG CCCTGGTGCG TCACCCCCTG GTCTACGCCC TGTCCCTGGG CGTCTACGCC ACCTCCTGGA GCTTCTACGG CAGCGTCGGC TTCGCCGGCG AGCAGGGCTA CAACTTCCTC ACCATCTACC TGGGGGTGAC CCTGGCCTTC GCCCTGGCCC CGGTGCTGCT GGCGCCCATC CTGCGCCTGG TGCGGGACTA CCAGCTCACC TCCCTGGCGG ACCTGTTCGC CTTCCGCTAC CGGCACCAGT TCACCGGCGT GCTGGTGACC CTGTTCATGC TCGCCGGGGT GCTGCCCTAC ATCGCCCTGC AGATCAAGGC GGTGACCGAA TCCGTGGCGG TGCTCACCCA GGAGGCGCCG CCCCACCTGC TGGCGCTGGG CTTCTGCGCC ACCCTGAGCC TGTTCGCCAT CCTGTTCGGG GCACGGCACA TCACCCCCCG GGAGAAGCAC GAGGGCCTGG TGATGGCCAT CGCCTTCGAG TCGGCGGTCA AGCTCATTGC CCTGCTCATG GTGGGCGTGT TCGCCCTGTT CGGCATCCTG GGCGGCCCCG CGGCCCTGGG GGAATGGCTC CTGGAACACC CGGAGGCCAT CGAGGCCCTG TACGCGCCGG TGCGTGAGGG GCCGTGGATG ACCTTCATGC TGCTGGCCTT CGCCGCCGCC TTCCTGCTGC CGCGCCAGTT CCACATGATC TTCGTGGAGA ACATGGAACC GGGGGCCCTG CGCGTGGCGG CCTGGGCCTT CCCCCTGTTT CTGTTGCTGC TCAACCTGCC CATCCCCCTG ATCCTGTGGG CCGGTCAGCA GGCCCAGGTG GGCACGGTAC CGGACTACTA CGTGCTCGGC CTGTCGGTGG AGCACGGCTA CGGCTGGGCA CTGTTCACCT TCATCGGCGG CGTCTCGGCG GCCAGCGCCA TGATGATCGT CACGACGCTG GCCCTGGCGG CCATGTGCCT CAATCACCTG ATCCTGCCTG CCCGCTACCA GAACCAGGCC CTGCCCAGGC AGAATCTCTA CACCTGGCTG CTGTGGGGCC GTCGCATCCT GATCGTGCTC ATCATCGCCC TGGGCTACGT GTTCTACCGG CTGATGGAAT TCCACCAGAG CCTGGCCCAG ATCGGCCTCA TTTCCTTCGT CGCCGTGGCC CAGTTCCTGC CCGGCCTCAT GGGCCTGCTG TTCTGGCCCC GAGCCACCCG CTGGGGCTTC ATCGCCGGGC TCGCCGGCGG CATCACCGTC TGGGCCCTGG CCCTGATCGT GCCCCTGCTG CAGCAGTCGG GACTCATGCC CGGGCAGCTG GAACTGCAGA CCTGGTTCCG CGCCACGGAA CAGAACCACT GGGCCTTCGC CACCTTCTGG TCCCTGGCGC TGAACGCGGT GCTGTTCGTG GCCGGCTCCC TGCTCACCCG GCCCAGCCCC GAGGAGACCG AGGCGGCACT CGCCTGCCGC CGCGAGCAGA TGGAACCGCC CCGGGGCCAG CTGGAGGTGC GCAGCGTGCA GCACTTCGAG GAACAGCTGG CCCAGACCGT GGGTCCACAG ACCGCCGCCC TGGAGGTGAG CCGCGCCCTG GCGGACCTGG GGCTGAAATC CGACGAGACC CACCCCGGTG AACTGCGTCG CCTGCGTGAA CGCCTGGAGC GCAACCTCTC CGGCCTCATG GGTCCGCTGC TCTCGCGCAT GATCGTGGAC CACCGCCTGC GGGTGGACCC CCACACCCAG ATGGCCCTGG CCGACAGCCT GCGTTTCGTG GAGGAGCAGC TGGAGCACTC CAACGTGCGC CTGCAGGGCC TGGCGGCGGA GCTGGACACC CTGCGCCGCT ACCACCGCCA GGTGCTGCAC GAACTGCCCC TGGGGGTCTG CTCCCTGACC CCCGGCGGCG AGATCATCAT CTGGAACAGC GCCATGGGAC TGATCTCCGG CATCCCCAGC CGCGACGCCG TGGGCCAGGA GCTGCGCCTG CTGGGCGATC CCTGGCACGG CATCCTGCGC GGCTTCCTGC AGGGCGACGA CAAGCACCTC TACAAGCTGC AGATCAACCT CCAGGGCCGC AGCCGCTGGA TCAACCTGCA CAAGTCGGAG ATCGAGGCCC CCGGTGCCCT CAATCCCGGC GCCGTGGTGG GCGGCACCGT GATCCTGGTG GAGGACCTCA CCGAACTGCA CACCCTGGAG AGCGAAGTGG CCCACAACAA CCGCCTGGCC TCCATCGGCC GGCTGGCCGC CGGGGTGGCC CACGAGATCG GCAATCCGCT CACCGGCATC GCCTCCCTCG CCCAGAACCT GCACTACGAG CCCGACGAGT CGGAACGTGA GCTCAGCGCC CAGCAGATCC TCGAGCAGAC CCGGCGCATC AACAACATCG TGCAGAGCCT GATCACCTTC AGCCACACCG GCGAGGTGCC CAGCACCGCC ATGGGCGCGG TGGACATCAA GATCTGCGTG GACGAGGCCA TGCAGCTGGT GCAGCTGAGC GACACCGCCC GGGCGCACCT GTTCGAGAAC CGCCTGCCCC CGGGAATCAC CGTGCGCGGC AGCCACCAGA AGCTGCTGCA GGTGTTCGTC AACCTGATCA ACAACGCCAT GCAGGCCTCG GGACCCGGCG ACCGTATCCG CATCCAGGGT GAGGCCACGG CAGGCTTCGC CGAGATCCTG GTGGAGGACG AAGGCACCGG CATCCCCGAG ACCCTGCTCG ACCGGGTCTT CGAACCCTTC TTCACCACCA AGCCGCCGGG CGAGGGCACC GGGCTCGGCC TGTCCGTGGT CTACAGCATC ATCCAGGACC ACGGCGGCGC CGTGTACGTG GACAACCTGC CCCAGGGCGG CGCCTGCTTC ACCCTGCGCC TGCCCCTGAG CACGACGCCC AGGCTGCGCG CCAGCGGCTG A
|
Protein sequence | MNAELGILYA VGVAYLALLF FVAHATEKGW LPDALVRHPL VYALSLGVYA TSWSFYGSVG FAGEQGYNFL TIYLGVTLAF ALAPVLLAPI LRLVRDYQLT SLADLFAFRY RHQFTGVLVT LFMLAGVLPY IALQIKAVTE SVAVLTQEAP PHLLALGFCA TLSLFAILFG ARHITPREKH EGLVMAIAFE SAVKLIALLM VGVFALFGIL GGPAALGEWL LEHPEAIEAL YAPVREGPWM TFMLLAFAAA FLLPRQFHMI FVENMEPGAL RVAAWAFPLF LLLLNLPIPL ILWAGQQAQV GTVPDYYVLG LSVEHGYGWA LFTFIGGVSA ASAMMIVTTL ALAAMCLNHL ILPARYQNQA LPRQNLYTWL LWGRRILIVL IIALGYVFYR LMEFHQSLAQ IGLISFVAVA QFLPGLMGLL FWPRATRWGF IAGLAGGITV WALALIVPLL QQSGLMPGQL ELQTWFRATE QNHWAFATFW SLALNAVLFV AGSLLTRPSP EETEAALACR REQMEPPRGQ LEVRSVQHFE EQLAQTVGPQ TAALEVSRAL ADLGLKSDET HPGELRRLRE RLERNLSGLM GPLLSRMIVD HRLRVDPHTQ MALADSLRFV EEQLEHSNVR LQGLAAELDT LRRYHRQVLH ELPLGVCSLT PGGEIIIWNS AMGLISGIPS RDAVGQELRL LGDPWHGILR GFLQGDDKHL YKLQINLQGR SRWINLHKSE IEAPGALNPG AVVGGTVILV EDLTELHTLE SEVAHNNRLA SIGRLAAGVA HEIGNPLTGI ASLAQNLHYE PDESERELSA QQILEQTRRI NNIVQSLITF SHTGEVPSTA MGAVDIKICV DEAMQLVQLS DTARAHLFEN RLPPGITVRG SHQKLLQVFV NLINNAMQAS GPGDRIRIQG EATAGFAEIL VEDEGTGIPE TLLDRVFEPF FTTKPPGEGT GLGLSVVYSI IQDHGGAVYV DNLPQGGACF TLRLPLSTTP RLRASG
|
| |