Gene Acid345_2603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2603 
Symbol 
ID4070566 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3070624 
End bp3072336 
Gene Length1713 bp 
Protein Length570 aa 
Translation table11 
GC content59% 
IMG OID637984620 
Productperiplasmic sensor signal transduction histidine kinase 
Protein accessionYP_591678 
Protein GI94969630 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGAGA CTGACAAATC CGGGGCGTCG CCGCAACCGC TCAAAGCAAG CGGCGACGCC 
ACTATGCAGC GCTGGCAACG ACTTTTGCAG ACGATCAGCG CCAAGTTGCT GGTGTTGCTC
ATCGGAGCGC TGCTCGCAAT CTTCGGCGCG CTCGGCTTCG TCAACATTCG CCTGCACCGG
CAGCATCTCG AATCCGTCAC TCTCGTAGCC GCAGAGCGCA ATAGCGACGT GGTCAAACGC
AGCACTTCCC ATTACATGAT GCGCAATGAC CGCGAAGGCC TGTACGAGAT CATCCGCACC
ATGGCCGACG AGCCCGGCAT CAAGCGCATC CGGATCATCA ATCAGGAAGG TCTAATCCGC
TACTCCAGCG ACTCCGGCGA AGTCAATCGG CAGATCGACA AGAACGCCGA GGCCTGCTAC
GCCTGCCACG TCCAGGCCGC GCCGCTCACG AAACTCAACC GGCCCGATCG CTTCCGTATC
TATCGTGAAA ATGGCGAACG CGTGCTCGGC ATCATCACCC CCATCGAGAA CCAGGAAAGT
TGTTCTAACG CCGCCTGCCA CGCGCATCCG AAAGAACAGC AGATCCTCGG CGTGCTCGAT
GTTGACCTCT CGCTCGCGAA GGCCGATGCC GCCCTCGCCG AAAGCAGTTG GACGATGATC
GGCTATACCC TCATCGCGTC CGTCCTCATC TGCCTCGCGG CCTGGCTCTT CGTTTGGCGC
GTAGTACACG AACCGCTGGG CAAGCTCAAA GCCGGCACGG AGCGCCTCAC TCATGGCGAC
CTCGGCTATC AGCTAGAACT TGAAAATCAA TCGCACGACG AAGTTGGCGA ACTCGCGCAC
TCGTTCAACG AGATGAGCAG CCAGCTTCGC AACGCACGCG ATGAAATCAC GGCATGGAAC
CGCACCCTCG AAGACCGCGT CCTGGAGAAG ACGAGCGAAT TAAAGAAAGC CGCCGAGCGC
ATGCTACACG TAGAAAAGAT GGCGACCATC GGCAAAATGG CAGCCGTCGT TGCGCACGAG
ATCAACAACC CGCTGTCCGG CATCCTCACC TACTCCAAGC TGGTGAAGCG CTGGATCGAG
AAAGGTATCT TCGACGACGA GCCCAAGCGC CACGAGATGG CCGAGAACCT CGATCTCGTC
GCCACGGAGA GCCGTCGCTG CGGCGATCTC GTCAAGAACC TGCTGAGTTT CTCGCGCACG
AATCCGATCA ATCTCGAATG GATCGCGGTC AATCCCATCG TGGACCGGGT CGTAAAGCTC
GCGGCTCACA AGCTCGAGAT GGGCGGCATT CAAATTCACG TTGACACAGC TTCAGACATG
CCAGTAGTGC ATGCGGACGC TGCACAGATT GAACAAGTGC TCCTCGCGCT GACGATGAAC
GCGATTGATG CCATGCCGCA TGGCGGCAAT CTATGGATCG CCACCACCAT CACCGACCAC
AGCGAGCTTC TGCTCCAGGT CAAAGACGAC GGCATGGGAA TCTCATCGGA GATCCTGCCT
CGACTCTTCG AGCCCTTCCT CACTACGAAG GACACCGGCA AGGGCGTCGG GCTAGGCCTG
GCAATCAGCC ACAACATCGT GGAGCGTCAC AACGGCCGCA TCGAAGTCGA CTCCCAAGTC
GGCCGTGGAA CAACTTTCAA CGTTTATCTG CCCCTATCGG ATGAGAGCTA TGAGTTTTCT
CCAGCGGCAG CGAACCAAGA TGCAATGAGG TAA
 
Protein sequence
MPETDKSGAS PQPLKASGDA TMQRWQRLLQ TISAKLLVLL IGALLAIFGA LGFVNIRLHR 
QHLESVTLVA AERNSDVVKR STSHYMMRND REGLYEIIRT MADEPGIKRI RIINQEGLIR
YSSDSGEVNR QIDKNAEACY ACHVQAAPLT KLNRPDRFRI YRENGERVLG IITPIENQES
CSNAACHAHP KEQQILGVLD VDLSLAKADA ALAESSWTMI GYTLIASVLI CLAAWLFVWR
VVHEPLGKLK AGTERLTHGD LGYQLELENQ SHDEVGELAH SFNEMSSQLR NARDEITAWN
RTLEDRVLEK TSELKKAAER MLHVEKMATI GKMAAVVAHE INNPLSGILT YSKLVKRWIE
KGIFDDEPKR HEMAENLDLV ATESRRCGDL VKNLLSFSRT NPINLEWIAV NPIVDRVVKL
AAHKLEMGGI QIHVDTASDM PVVHADAAQI EQVLLALTMN AIDAMPHGGN LWIATTITDH
SELLLQVKDD GMGISSEILP RLFEPFLTTK DTGKGVGLGL AISHNIVERH NGRIEVDSQV
GRGTTFNVYL PLSDESYEFS PAAANQDAMR