Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0843 |
Symbol | |
ID | 4070976 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 1046364 |
End bp | 1048505 |
Gene Length | 2142 bp |
Protein Length | 713 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637982852 |
Product | PAS/PAC sensor signal transduction histidine kinase |
Protein accession | YP_589922 |
Protein GI | 94967874 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCACAGA AAGTCGAACC CTCCGCCGAA GAAATCGGTC GTCTCCAGCG GTGTATCAGC GACCTCATTG GTGTCGTTGC ACTTCCCACC ATGTGGAGCG GCGGCGATCC TGAACAAGTC GCTCGTACGC TTCTCGAAAC GCTGGAAGGC GTGCTCCAGT TGGACTTTGC TTTCCTTCGC CTTACGAATC CTCCGTCAGA GCTGCTGCGT ACAAGCGCCC TGACGCCGGA GCAGGTTCGT GCGATCGTCG CCCGATCTCT GGACGACACC CCCAATGCAG TGTGTCGTGC CGAGCTTAGT TCGGGACGCG ACATTTCGGC GATCTCCGTG CATCTCGGCT TGCACGGCGA ATTCGGCGTA CTCATCGTCG GGTCCCAACG CTTCGAGTTC CCCCAGCAAA CCGAACGACT GTTGCTGAAT GTGGCCGCGA ACCAGGCAGC CATCGGTTTG CAAGAGGCAC GCCGCTTCGG AGAGCAAAAG CGCATCGCTC GTGAACTTGA CGACAAGGTA GCGCAGCGCA CCCGAGAACT CGCTGACGCG AACGAGGCCC TTCGCCACGA AATCGAAGAT CGTAAGGAGA TTGAAGCGCG CCTCCTCGAG AGTAAAGAAC AGCAATACCA CGTCCGAGTG GAGCTGCAAA AAGCGCTCGA TGAAATCCGC AAGTCCGAAG CGAAGTTGCA CCAGGTCATT GACACCATCC CCACCCTCGC CTGGTGCAAC CTGCCCGACG GGCCCAATGA GTTCCTCAGC AAACGATGGC ACGAGTACAC CGGACTCTCG CCGGAAGAAT CGCATGGCTG GGGTTGGCAA ACCGCGTTTC ATCCTGAAGA TCTGCCGGCG CTGATGAAAA AGTGGATGGA ACTGATCGAG ACCGGAGAAC CGGACGAAAT CGAATCGCGC CTCCGCCGTT ACGACGGCGT TTATCGATGG TTCCTCATCC GCGTCGAACC CTTCCGAGAT GAGACCGGAA CCATCGTGCG CTGGTACGGC ACCAGCACCG ATATCGAAGA ACGCAAGCAG GCAGAAGAAC GATCGCGCCG CAGCGAAGCA TTCCTCGCTG AGGGTCTGAA CCTTGCGCGT GTAGGAAACT TTTCCTGGCT CGTGGAAACC GACGACATCA AGTGGTCCGA CCAGCTCTAC CAGATTTTCG AATTTGAGCC CGGCCAGCCG ATAACCTTCG AGAAAATTGG CTCTCGCGTA CATCCCGACG ACGTGCACAC GTTGTACAAC ATGATCGAAA AGGCCCAGCG TAACGTGAGC GACTTTGAAT ATGAACATCG CTTGCTCATG CCGGATGGAA GCGTGAAGTA TTTGCGCCTG GTAGCTCACG CCGGCCGCAA CTCCGAACAC CAAGTCGAGT ACATCGGTGC GGTTCAGGAT GTAACCCAGC GCCATCTCGC TGATGATGCC TTGGCGCGCG CGCGCTCGCA GTTGGCAAAC GTGTCGCGGG TCACCAGTCT CGGCGTCCTG ACTGCGTCCA TCGCGCACGA GGTCAATCAG CCACTTTCGG GCATCATCAC CAACGCCAGT ACTTGCCTCA GGATGCTTTC CGCGGAACCG CCGAATGTCG AAGGCGCTCG CGAAACCGCA CTGCGCACCA TTCGCGACGG CAATCGCGCC GCCGACGTCA TCTCCCGATT GCGCACACTG TTCACGAGAA AGGACCGGTC CGCCGAGGCC GTCGATCTCA ACGACGCGAC CAAAGAGGTG ATCGCACTTG CTTTGAATGA ATTGCACCGC GGAAAGGTGG TCTTGCGGCC GGAACTCGGA GATGACCTTC CACCCGTCAT CGGTGATCGC GTCCAACTCC AGCAGGTGAT CATGAACCTC ATGCGCAATG CCTCCGACGC GATGAGCACC ATCCACGATC GTCCTCGGGA TCTATTGATC CGCACCGAGT CGGACGGCGA GGCCGTACGC TTGAGCGTCA CGGATTCCGG CGTAGGCTTC GATGCACAAT CCGCCGACCG GCTCTTCGAG GCCTTCTATA CAACCAAAAA CGATGGTATG GGGATCGGCC TCTCCATCAG CCGCTCCATC ATCGAGGCCC ACCAGGGGCG GCTCTGGGCA ACACCGAACC AGGGCCCCGG CGCCACCTTC TGCTTTTCGC TTCCGTGCAG CACTGATACC AAAGTCCAAT AG
|
Protein sequence | MSQKVEPSAE EIGRLQRCIS DLIGVVALPT MWSGGDPEQV ARTLLETLEG VLQLDFAFLR LTNPPSELLR TSALTPEQVR AIVARSLDDT PNAVCRAELS SGRDISAISV HLGLHGEFGV LIVGSQRFEF PQQTERLLLN VAANQAAIGL QEARRFGEQK RIARELDDKV AQRTRELADA NEALRHEIED RKEIEARLLE SKEQQYHVRV ELQKALDEIR KSEAKLHQVI DTIPTLAWCN LPDGPNEFLS KRWHEYTGLS PEESHGWGWQ TAFHPEDLPA LMKKWMELIE TGEPDEIESR LRRYDGVYRW FLIRVEPFRD ETGTIVRWYG TSTDIEERKQ AEERSRRSEA FLAEGLNLAR VGNFSWLVET DDIKWSDQLY QIFEFEPGQP ITFEKIGSRV HPDDVHTLYN MIEKAQRNVS DFEYEHRLLM PDGSVKYLRL VAHAGRNSEH QVEYIGAVQD VTQRHLADDA LARARSQLAN VSRVTSLGVL TASIAHEVNQ PLSGIITNAS TCLRMLSAEP PNVEGARETA LRTIRDGNRA ADVISRLRTL FTRKDRSAEA VDLNDATKEV IALALNELHR GKVVLRPELG DDLPPVIGDR VQLQQVIMNL MRNASDAMST IHDRPRDLLI RTESDGEAVR LSVTDSGVGF DAQSADRLFE AFYTTKNDGM GIGLSISRSI IEAHQGRLWA TPNQGPGATF CFSLPCSTDT KVQ
|
| |