Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0692 |
Symbol | |
ID | 4071337 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 852741 |
End bp | 854603 |
Gene Length | 1863 bp |
Protein Length | 620 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637982698 |
Product | PAS/PAC sensor signal transduction histidine kinase |
Protein accession | YP_589771 |
Protein GI | 94967723 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTCAGA ATTTCAAGCC TTTAAATCCG GAAGAGAGCG TCGTGCGCCT CGCCGCCATC GTGGACTCGT CCGATGACGC CATTCTTGGT ACCGACACCC AAGGCACCAT CAACACCTGG AACCCTGCTG CCGAGAAAAT GTTCGGCTAC TCCGCCGACG AAATCCTCGG CCAATCAGTC CTGCTGCTCA TTCCCCCTTC ACTGCATGCC GAGCACGCAG AGAACTTCGA GCGCATTGCC CGGGGCGAGC GTATCGAGCA CTACGAAACC CAACGTTTGC AGCGAGGTGG CGGACGCATT GATGTATCGT TTACGCTTTC CGGCATTCGA GACCGCGAAG GCCGCATGCT CGGCGCGGCC ATGATTGCTC GGGAACTGAG CGCAAAGCGG CGGGACGAGG CGGTCCGTGC CCGGCTCGCC GCGATTGTCG AGTCCTCGGA CGACGCCATC ATCGCCAAGG ACCTCAACGG TGTCGTCACG GACTGGAATG CCGCCGCCGA GCGCCTCTTC GGATATAAAG CCGAAGACAT CATCGGACGC TCCATTCTCG CCATTATTCC CCCCGAACTT CAGCACGAAG AGCCGGTGAT TCTTTCCAAA ATCCGCGCCG GGCACCGCAT GGAACATTAT GAGACCCATC GCCTTCATAA ATCAGGGCGC CGCTTGGAAG TCTCAGTCAC AATTTCCCCG ATCCGCGACT CGTCTGGCCG CGTGATTGGC GCCTCGAAGT TCGCTCGCGA TATTTCCGAA AAGCGCCGCC TCCAGACCGT TCGGAGCATT CTCGCCGCGA TCGTCGAGTC TTCGGACGAC GCCATCGTCT CGAAGAACCT CGACGGCGTC ATCACCAGTT GGAACGCCGC CGCGGAGCGC TTGTTCGGTT ACACCGCTGA AGAGATCATC GGACAGTCCG TATTACGTAT CATTCCGCGC GAACTTCAGC ACGAGGAACC CGGTATTATT ACCCGCCTGC GCGCCGGAGA GCGGATCGAT CACTACGAGA CTCGGCGGCG GAAGAAGAAT GGCGAGAGCA TCGACGTCTC CTTGACCGTC TCCCCCATTC GCGATGAACG CGGCACCGTG ATTGGCGGGT CCAAGATTCT GCGTGACATC AGCGACCGCA AAATCGCCGA GGCCGCAATC ATCGAGAAGG AACGCTTTGC TGCCGCCGGA CGCCTCGCCG CGACCCTCGC GCATGAAGTC AACAATCCAC TCGAGGCCAT CACCAACCTC TCGTACCTGC TCTCCATCCA TGAAGGTCTC GATTCCGAGG CTGCTAATCT CGCCGCTTTG CTTTTGAAAG AGGTCCAGCG CGCGGGAGAG ATCACCCGGC AGACCCTGGT GTATTACCGC GAGTCCAAGG TGCCGTTGCT CGTCAGTCTT CGCGAAGTCG TCGCCAGCGT TCTCCGCGCC AAGCGCTCGA AGCTCGAGCT AAAGAACGTC CACGTCGATA GCGCCTTACC TGAACCTTTT TTCGTCGAAG GCTATCCGGG TGAGTTGCGC CAGGTGCTCG AGAATCTGCT CGACAACGCT CTCGACGCCG TGCCCGACGG CGGGCATCTC CAGATCCAGG GCAGCCGCTC CGTCTCCGCC GCCAACGAGC GCGTCCTCCT CTCGATTTGC GATAACGGCC CCGGCATTCC CGCCGAACTG GCCGGGAAAA TCTTCGAACC GTTTTTCACT ACCAAGAAAG AGAAGGGTAG TGGCCTCGGG CTCTGGGTCT CGCAGTCCAT CGTCAAGAAA CATCAGGGCA CGATCGAAGT TCGCAGCAAC CAGGAAAACC GCGAGACGGT TTTTACCCTC AATCTCCCTG CTGCCAGGCT TCCCGAGCCC GCGGACCGCC GCTCTCCGGC TGCAGTTTCC TGA
|
Protein sequence | MSQNFKPLNP EESVVRLAAI VDSSDDAILG TDTQGTINTW NPAAEKMFGY SADEILGQSV LLLIPPSLHA EHAENFERIA RGERIEHYET QRLQRGGGRI DVSFTLSGIR DREGRMLGAA MIARELSAKR RDEAVRARLA AIVESSDDAI IAKDLNGVVT DWNAAAERLF GYKAEDIIGR SILAIIPPEL QHEEPVILSK IRAGHRMEHY ETHRLHKSGR RLEVSVTISP IRDSSGRVIG ASKFARDISE KRRLQTVRSI LAAIVESSDD AIVSKNLDGV ITSWNAAAER LFGYTAEEII GQSVLRIIPR ELQHEEPGII TRLRAGERID HYETRRRKKN GESIDVSLTV SPIRDERGTV IGGSKILRDI SDRKIAEAAI IEKERFAAAG RLAATLAHEV NNPLEAITNL SYLLSIHEGL DSEAANLAAL LLKEVQRAGE ITRQTLVYYR ESKVPLLVSL REVVASVLRA KRSKLELKNV HVDSALPEPF FVEGYPGELR QVLENLLDNA LDAVPDGGHL QIQGSRSVSA ANERVLLSIC DNGPGIPAEL AGKIFEPFFT TKKEKGSGLG LWVSQSIVKK HQGTIEVRSN QENRETVFTL NLPAARLPEP ADRRSPAAVS
|
| |