Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3503 |
Symbol | |
ID | 4072762 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 4133242 |
End bp | 4136373 |
Gene Length | 3132 bp |
Protein Length | 1043 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637985526 |
Product | multi-sensor signal transduction histidine kinase |
Protein accession | YP_592578 |
Protein GI | 94970530 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG2202] FOG: PAS/PAC domain [COG5002] Signal transduction histidine kinase |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCACCAG AAACCATCTC TTTCGATCGT CAGCCTTCGC CGGGTATGGA ACGCCGCGCG CGCTCATTCC GACCGGCCGT GGCATTTGCA TTTCTCGCGG GCATGGCGAT CATCCTGATC GTGGGTTGGC TCAGCTATCG CACCACGACC ACCCTGATCG AAGACAGTTC CTGGACATCG CATACCCAGG ATGTAATTAC CAATCTCGAC CAACTGCAAT CCACCCTGGA GCACGCCGAG TCATCCCAAG GGCGTTACCT CATCACCGGC GACGAAACAT TCCTGAAGAA CTATGAATTG GACGTCCGGA CCACGCGTGA GCTTAGCCGG AACCTGTTGC AACTGACTTC CGATAATGCC ACGCAGCAGT TGCGCTTAAA GGACCTCCAA GGGCTGATCG AACAAAAAAT CGATCACATG AATTTGACCC TCGTTTTGAG GCGAAATAAC GGATTGACGG AGGATGTGCA ACGTCGTGTC GCGACCGAAG GCAAGCGTCG GATGGATGAG ATCCAGACGA AGGTCTCGGA AGGCGTCGCA CTTGAAACCA GGCTTCTGAG CGTACGGATC GAGGCGCAAC GCCGCAGCGC GGGGAAATCA CTCCGAAGCA TCCTCACCGG CGGATTGCTC GCAATGTTGT TCCTAGCGGC GGCGGGACTG GTCCTGCAGC GCGATATCCA GAAACGATTT GCAGTCGAGC GGCAACTCCA GCGCACAACC GCCCTGCAGC GAGCGGTGCT GAACAGCGCG AATTACGCCA TCATCTCCAC CGACACCTCC GGCACGATCA TCAGTTTCAA CTCGGCGGCT GAACAGATGC TTGGCTATCA CGCGAGTGAA GTAGTCGGCC GGCTCGCCCC GGAAAAGCTC CACGACCCTA CGGAACTCGA ACAGCACGCC GAACAAATGA GCCGGTTCTT CGGACAGAGC ATCTCCGCCG GCTTCGAATC TCTGATCGCA AAGGCTCGAT TAGGAACCAT CGACGAAAGC GAATGGACCT ATGTCCGCCG TGACGGTTCC CGCTTCTTTG GCCTGCTCTC CACCAGCGCG ATGCACGATG AAAACGGCGC TATCACCGGG TACGTATTCA TCGTCAGCGA TGTCACCCGG CGCAAAGATG CGGAGAAGGC CAAGAGCCAA ATCGAACGGC GCTACCGTGC GTTGCTGCAA AACAGTAGCG ACATGGTCGC CGTAATCGAC GCGGCTGGAC ACTTGCAGTA CATTAGCCCG GCAGTCGAAA GGCTGCTGGA ATTCGAAGTA CAGGAACTAG TCGGCCGCGA GATCTTCGAC ATCATTCATC CCGCGGACGT GGAAACCGCG CGGACCTCTT TCTACTCGAT CGCTTTGACT CCGGGTTACT CTGCTCCGCA GGAACTGCGG TTGCGTCGCG CCGACGGCGA ATACCTAACC ACGGAGATTG TCGCCAACAA CCTCCTCACC GACGAAGTGC TGCACGGCAT CGTTTTAAAT GCTCGCGACA TCACCGAGCG CAGCCGCGCC CGGGCACAAC TCGAAGTGCA GAACGCCGTT GCTCGTGTGT TGGCGGAAGC GGAGAACCTC GACCAATCGA TTCCCGAGAT CTTGCAGGCT CTCTGTAACA ACCTTGACTG GGAACTGAGT GAATTTTGGG GAGTAGATCC TGAACAAGAC TCGATGACCT TCAACTTCGC GTGGTCGCTT CCTGGAATCG ATCTGAGCGA GTTCCTCGAT ATCAGCCAGC ACACCCGCAT CCAGCGCGGC GAGGGACTCG CCGGCCGAGT TTGGGAGAAG GCGACAGCCA TCCAGGTTCC AGACATCACG CAGGAAGAGA ATTTCGTTCG CAAGATCGAA GTCGAAGCAC TTTCGCTGAA GACAGCTGTC GGCTTTCCCA TCCGTTCGCG CGAAGGCGTG ATCGGCGTGT TCACCCTGTT CAGCATGCGG CATCGTCACG TGGACAACCA CCTCCTCTCG ATGCTGAATA CGGTGGGAGC GCAAATCGGC CAATTCATTG CACGCAAGCG CGCCGAACAG GAAATTACCC AGAACGAGGA TCGTTACCAC TACCTGTTTG AGAATTCGGC GGACTTGATC CTCACCTTTG GGACTGACGG CACGATCCTG CATCCGAACT CCACGTGGAT GAGCACGCTG GGATATTCCC GCGAGGAACT CCTGAAAAAG CCGCTCTTCG ACCTCATCGG TCCCGAAGAC CGCGAACGCT GCAAAGCGAT CATCGGAATG ATCGTGAGGA GCGGCAGCAC GGACAAGGTT GAGCTCACCT TCCGATCGCA GGATGGTCGC AAGATCGTCG TCGAAGGCAC AATCAGCTGC CGGTACGGCA TAACGGGCGT GGAATATTGC AGTGCGATTT TCCAGGATGT AACCAAGCGG CGTGAAGTCG ATCGTATGAA GAACGAGTTC ATCTCGGTGG TGAGCCATGA ACTACGCACG CCGCTGACAT CTATCCGCGG CTCGCTCGGG CTGCTCGCTG GAGGCGCCTT ACGTAAAGAT CCGGAGAAAG CCGACCGGAT GCTCGACATC GCACTGAAGA ACACCGAGCG ATTGGTGCGG CTCATCAACG ACATTCTCGA CATCGAGAAG ATCGAATCCG GTAACATCGC GCTAAACGTC CAACCGCTCG ATGCCGCAGA TCTGATTTCG CAAGCCAGTG CAACCATGCA TGCCATGGCA GACGCTAACA AGGTTCGGCT GGAGACCCAT TCGACGCGGG GCATCCTTTA TGCCGACCGC GATCGTATGC TTCAAACCCT CACCAACCTG TTAAGCAATG CCATCAAGTT TTCCAAGCCC GACAATACCG TGACGATCAG TTCCCAGCGC CGGGGAGGGG GGCTCCTGAT TCGCGTGCGT GACCAGGGCA GAGGCATTCC GTCTAACAAG CTGCAAACGA TTTTCGAGCG TTTCCAGCAG GTAGATGCGT CGGACTCGCG CGACAAAGGC GGTACAGGTC TTGGCCTGGC GATCTGCCGC AGCATCGTGC AGCAGCACGG CGGATCGATC TGGGTCGACA GCATCGACGG AAAAGGTAGC GAATTCTTTA TCCTGCTTCC CCGCTTCCAG GAAGAAGACG CCTCCATAGT GCAAGCCGAT GCTTCCCCCG GTCCCACTTC CGGGGCCGCC CCTGCAAATT AG
|
Protein sequence | MSPETISFDR QPSPGMERRA RSFRPAVAFA FLAGMAIILI VGWLSYRTTT TLIEDSSWTS HTQDVITNLD QLQSTLEHAE SSQGRYLITG DETFLKNYEL DVRTTRELSR NLLQLTSDNA TQQLRLKDLQ GLIEQKIDHM NLTLVLRRNN GLTEDVQRRV ATEGKRRMDE IQTKVSEGVA LETRLLSVRI EAQRRSAGKS LRSILTGGLL AMLFLAAAGL VLQRDIQKRF AVERQLQRTT ALQRAVLNSA NYAIISTDTS GTIISFNSAA EQMLGYHASE VVGRLAPEKL HDPTELEQHA EQMSRFFGQS ISAGFESLIA KARLGTIDES EWTYVRRDGS RFFGLLSTSA MHDENGAITG YVFIVSDVTR RKDAEKAKSQ IERRYRALLQ NSSDMVAVID AAGHLQYISP AVERLLEFEV QELVGREIFD IIHPADVETA RTSFYSIALT PGYSAPQELR LRRADGEYLT TEIVANNLLT DEVLHGIVLN ARDITERSRA RAQLEVQNAV ARVLAEAENL DQSIPEILQA LCNNLDWELS EFWGVDPEQD SMTFNFAWSL PGIDLSEFLD ISQHTRIQRG EGLAGRVWEK ATAIQVPDIT QEENFVRKIE VEALSLKTAV GFPIRSREGV IGVFTLFSMR HRHVDNHLLS MLNTVGAQIG QFIARKRAEQ EITQNEDRYH YLFENSADLI LTFGTDGTIL HPNSTWMSTL GYSREELLKK PLFDLIGPED RERCKAIIGM IVRSGSTDKV ELTFRSQDGR KIVVEGTISC RYGITGVEYC SAIFQDVTKR REVDRMKNEF ISVVSHELRT PLTSIRGSLG LLAGGALRKD PEKADRMLDI ALKNTERLVR LINDILDIEK IESGNIALNV QPLDAADLIS QASATMHAMA DANKVRLETH STRGILYADR DRMLQTLTNL LSNAIKFSKP DNTVTISSQR RGGGLLIRVR DQGRGIPSNK LQTIFERFQQ VDASDSRDKG GTGLGLAICR SIVQQHGGSI WVDSIDGKGS EFFILLPRFQ EEDASIVQAD ASPGPTSGAA PAN
|
| |