Gene Acid345_3600 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3600 
Symbol 
ID4072822 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4258308 
End bp4260608 
Gene Length2301 bp 
Protein Length766 aa 
Translation table11 
GC content60% 
IMG OID637985623 
Productperiplasmic sensor hybrid histidine kinase 
Protein accessionYP_592675 
Protein GI94970627 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.35244 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.024803 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGAATC GCGTTCCAGC GCGAAGTCGT TTTGCTTCGT GGTTTTCCGG GCAATACTGG 
TGGCGCGTTC CAGTTGCCAC CATTGTCCTA CTACAAGTTA TAACGCTGAT TGCGTTGCAC
TTGGTGGGCC GGGAAATCAT CGTCGAGAAC TGCCTCGACT TCGGGCTCTC TTCGATCGCA
ATCGTCGTTG CCTGGCAGGC GTCGCGGCGC TCTCGCGGCC TCACCCGTGT CGCGTGGATG
ACGTTGTCGC TCTGCTGCTC GTTTTTCGCT CTATCGCTCG CGATCGGCAT CGGCGCGGAA
GTGCTGCCTT ACACCAGGGC GCTTACCGAA CTCTCCGACG AGATCAGCGT TTTCTGGCTG
GCGCCACTCA GCTTCACGCT TTTCCTCGAA CCTGATTTCG AGTTCCGCGG CTTCGATCCC
ATCCACGTCC TCGACTTCAT CCAGCTCGTG ATCTTCTGGG TCGCGGTCTT CTTCATGTTT
CTGTTCCTCC CCATGCATGT CTTCGTCGGC CAGACGCCTC ATTCCTGGCT GCAAGCTACC
TGGGGCGGAA CCTTGGTCTA CGACCTCTCG ATGACCATGC TGTTCAGCCT GCGCGCCATG
CTCACGCGCT CACGGTCCAC ACGCCGCTTC TTCTGGCGTT TTTCGCTCTT TCTGGTTGTC
GGATGCCTTG CCGATCTTTA CGCCAACTAC AACAAGCTGC CGTCGGCTAC GTGGTTCCAA
CTCGTCTGGA CCGCTCTCAA TATCGCTCCT ATCGTCCTCG CAGGCACGTG GACTGAGGAC
GAAACCGACT GGATTGACGC GAAGAGCAAG GCGGGACGCG TGCTAGGGGA CCAACTCTTC
CCGGTGATGA CCGCTTTCCT CGTGCTCATT CTCTCCATGG TCATCGTCCG CGAGCGCCTT
GGCTTCGCCG TGATGATGGT TTCGATTTCG TTCGTCTGCT CCAGCCTGCG CATGGTGGTG
GTGCAGCAGC GCGAACTGCG CATCGCCGCT GATCTACAGG CGGAGATCGC AGAGCGTCGT
CGCGCTGAAC AATTGCTGCG CGAAAACGAA GAACACCTCG AGGAACAAGT CGCCAACCGC
ACCAGCGAAC TGAGCGAGGC CAACTCGCAA CTACGCTCCG AGATTATCGA GCGCCAGCGA
CTGGAAGAAC AGTTGCGCCA GACGCAAAAG ATGGAAGCGA TCGGCACACT CTCCGGCGGC
ATCGCGCACG ACTTCAACAA CCTGCTGACG GTGATTCGCG GTTATGCACG CATGGTGCTC
GATCGCGTCG GGAACGATCC TGAGCTTCGC ACCGACGTTG AGCAGATTGA CGAAGCCGGC
GCCCGCGCCG CCGCGCTCAC CAGCCAGTTG CTGGCCTTCA GCCGTCGTCA GATGCTGCAG
CCGCGCACCA TCAACCTCAA CGGTCTCGTT CGCGATCTGC AGAAGATGCT GCAGCGCCTC
ATCGGCGAGC ACATCGAGCT CACTACGCGT GCGGGCGAAG GACTCGGCGC GGTGAAGGCC
GATCCCGGAC AGATCGAGCA GGTCATCCTG AATCTCGTCG TAAATGCGCG CGACGCCATG
CCCCGGGGCG GATCGCTGGT GCTGGAAACG CGAAACGTCC AGGTGGACGA AGCGTTCGCG
CGCGAGCACG TGGACCTTAC GCCCGGCGAT TACGTGATGC TCGCGGTGCA CGATACCGGC
GTGGGCATGG ACGACGCTAC CAAGGTGCAC ATCTTCGAAC CGTTCTTCAC CACCAAGGAA
CGCGCCCGCG GCACCGGCCT CGGCCTCTCG ATGGTTTATG GCATCGTCAA GCAGAGCGGT
GGAAGCATTG TGGTGGAGAG CCAGCTCGCG AAGGGTTCGA GCTTCAAGAT TTTCCTGCCG
CGGATCCAGG CGCGTACCGA AGTCGAGACG GGCTTGCATC CCGTCGCCGT GGGGAAGGGT
TCGGAAATCG TGTTGCTCGT GGAAGACGAC GAGCAGGTGC GCAACCTGAC GCTCACCATG
CTGCGCCGCT TTGGCTATAC CGTGTACGTG GCCGAGAACG GCACCGAAGC CTTGAAATTT
AACGAAACTC ATAGCGGCGA AATCAATTTG CTGCTCACGG ACGTGGTGAT GCCCGGCGCC
AGCGGTCCTG AAATCGCGGT CAAAATTTCG GCCAGCCGGC CAGGCATCAA GGTGCTCTAC
ATGTCCGGTT ACACTGACGA CGCAATCGGC ACCCACGGCA TTCTCGAAGA GGGCATTTCA
CTACTGCAAA AACCATTTAC CCCCGCGGCG CTCGGCGAGC GCGTACGCGA GGTGTTGGAC
GCCGTTCCGG TCAAAGCCTG A
 
Protein sequence
MLNRVPARSR FASWFSGQYW WRVPVATIVL LQVITLIALH LVGREIIVEN CLDFGLSSIA 
IVVAWQASRR SRGLTRVAWM TLSLCCSFFA LSLAIGIGAE VLPYTRALTE LSDEISVFWL
APLSFTLFLE PDFEFRGFDP IHVLDFIQLV IFWVAVFFMF LFLPMHVFVG QTPHSWLQAT
WGGTLVYDLS MTMLFSLRAM LTRSRSTRRF FWRFSLFLVV GCLADLYANY NKLPSATWFQ
LVWTALNIAP IVLAGTWTED ETDWIDAKSK AGRVLGDQLF PVMTAFLVLI LSMVIVRERL
GFAVMMVSIS FVCSSLRMVV VQQRELRIAA DLQAEIAERR RAEQLLRENE EHLEEQVANR
TSELSEANSQ LRSEIIERQR LEEQLRQTQK MEAIGTLSGG IAHDFNNLLT VIRGYARMVL
DRVGNDPELR TDVEQIDEAG ARAAALTSQL LAFSRRQMLQ PRTINLNGLV RDLQKMLQRL
IGEHIELTTR AGEGLGAVKA DPGQIEQVIL NLVVNARDAM PRGGSLVLET RNVQVDEAFA
REHVDLTPGD YVMLAVHDTG VGMDDATKVH IFEPFFTTKE RARGTGLGLS MVYGIVKQSG
GSIVVESQLA KGSSFKIFLP RIQARTEVET GLHPVAVGKG SEIVLLVEDD EQVRNLTLTM
LRRFGYTVYV AENGTEALKF NETHSGEINL LLTDVVMPGA SGPEIAVKIS ASRPGIKVLY
MSGYTDDAIG THGILEEGIS LLQKPFTPAA LGERVREVLD AVPVKA