Gene Acid345_0843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0843 
Symbol 
ID4070976 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1046364 
End bp1048505 
Gene Length2142 bp 
Protein Length713 aa 
Translation table11 
GC content59% 
IMG OID637982852 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_589922 
Protein GI94967874 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACAGA AAGTCGAACC CTCCGCCGAA GAAATCGGTC GTCTCCAGCG GTGTATCAGC 
GACCTCATTG GTGTCGTTGC ACTTCCCACC ATGTGGAGCG GCGGCGATCC TGAACAAGTC
GCTCGTACGC TTCTCGAAAC GCTGGAAGGC GTGCTCCAGT TGGACTTTGC TTTCCTTCGC
CTTACGAATC CTCCGTCAGA GCTGCTGCGT ACAAGCGCCC TGACGCCGGA GCAGGTTCGT
GCGATCGTCG CCCGATCTCT GGACGACACC CCCAATGCAG TGTGTCGTGC CGAGCTTAGT
TCGGGACGCG ACATTTCGGC GATCTCCGTG CATCTCGGCT TGCACGGCGA ATTCGGCGTA
CTCATCGTCG GGTCCCAACG CTTCGAGTTC CCCCAGCAAA CCGAACGACT GTTGCTGAAT
GTGGCCGCGA ACCAGGCAGC CATCGGTTTG CAAGAGGCAC GCCGCTTCGG AGAGCAAAAG
CGCATCGCTC GTGAACTTGA CGACAAGGTA GCGCAGCGCA CCCGAGAACT CGCTGACGCG
AACGAGGCCC TTCGCCACGA AATCGAAGAT CGTAAGGAGA TTGAAGCGCG CCTCCTCGAG
AGTAAAGAAC AGCAATACCA CGTCCGAGTG GAGCTGCAAA AAGCGCTCGA TGAAATCCGC
AAGTCCGAAG CGAAGTTGCA CCAGGTCATT GACACCATCC CCACCCTCGC CTGGTGCAAC
CTGCCCGACG GGCCCAATGA GTTCCTCAGC AAACGATGGC ACGAGTACAC CGGACTCTCG
CCGGAAGAAT CGCATGGCTG GGGTTGGCAA ACCGCGTTTC ATCCTGAAGA TCTGCCGGCG
CTGATGAAAA AGTGGATGGA ACTGATCGAG ACCGGAGAAC CGGACGAAAT CGAATCGCGC
CTCCGCCGTT ACGACGGCGT TTATCGATGG TTCCTCATCC GCGTCGAACC CTTCCGAGAT
GAGACCGGAA CCATCGTGCG CTGGTACGGC ACCAGCACCG ATATCGAAGA ACGCAAGCAG
GCAGAAGAAC GATCGCGCCG CAGCGAAGCA TTCCTCGCTG AGGGTCTGAA CCTTGCGCGT
GTAGGAAACT TTTCCTGGCT CGTGGAAACC GACGACATCA AGTGGTCCGA CCAGCTCTAC
CAGATTTTCG AATTTGAGCC CGGCCAGCCG ATAACCTTCG AGAAAATTGG CTCTCGCGTA
CATCCCGACG ACGTGCACAC GTTGTACAAC ATGATCGAAA AGGCCCAGCG TAACGTGAGC
GACTTTGAAT ATGAACATCG CTTGCTCATG CCGGATGGAA GCGTGAAGTA TTTGCGCCTG
GTAGCTCACG CCGGCCGCAA CTCCGAACAC CAAGTCGAGT ACATCGGTGC GGTTCAGGAT
GTAACCCAGC GCCATCTCGC TGATGATGCC TTGGCGCGCG CGCGCTCGCA GTTGGCAAAC
GTGTCGCGGG TCACCAGTCT CGGCGTCCTG ACTGCGTCCA TCGCGCACGA GGTCAATCAG
CCACTTTCGG GCATCATCAC CAACGCCAGT ACTTGCCTCA GGATGCTTTC CGCGGAACCG
CCGAATGTCG AAGGCGCTCG CGAAACCGCA CTGCGCACCA TTCGCGACGG CAATCGCGCC
GCCGACGTCA TCTCCCGATT GCGCACACTG TTCACGAGAA AGGACCGGTC CGCCGAGGCC
GTCGATCTCA ACGACGCGAC CAAAGAGGTG ATCGCACTTG CTTTGAATGA ATTGCACCGC
GGAAAGGTGG TCTTGCGGCC GGAACTCGGA GATGACCTTC CACCCGTCAT CGGTGATCGC
GTCCAACTCC AGCAGGTGAT CATGAACCTC ATGCGCAATG CCTCCGACGC GATGAGCACC
ATCCACGATC GTCCTCGGGA TCTATTGATC CGCACCGAGT CGGACGGCGA GGCCGTACGC
TTGAGCGTCA CGGATTCCGG CGTAGGCTTC GATGCACAAT CCGCCGACCG GCTCTTCGAG
GCCTTCTATA CAACCAAAAA CGATGGTATG GGGATCGGCC TCTCCATCAG CCGCTCCATC
ATCGAGGCCC ACCAGGGGCG GCTCTGGGCA ACACCGAACC AGGGCCCCGG CGCCACCTTC
TGCTTTTCGC TTCCGTGCAG CACTGATACC AAAGTCCAAT AG
 
Protein sequence
MSQKVEPSAE EIGRLQRCIS DLIGVVALPT MWSGGDPEQV ARTLLETLEG VLQLDFAFLR 
LTNPPSELLR TSALTPEQVR AIVARSLDDT PNAVCRAELS SGRDISAISV HLGLHGEFGV
LIVGSQRFEF PQQTERLLLN VAANQAAIGL QEARRFGEQK RIARELDDKV AQRTRELADA
NEALRHEIED RKEIEARLLE SKEQQYHVRV ELQKALDEIR KSEAKLHQVI DTIPTLAWCN
LPDGPNEFLS KRWHEYTGLS PEESHGWGWQ TAFHPEDLPA LMKKWMELIE TGEPDEIESR
LRRYDGVYRW FLIRVEPFRD ETGTIVRWYG TSTDIEERKQ AEERSRRSEA FLAEGLNLAR
VGNFSWLVET DDIKWSDQLY QIFEFEPGQP ITFEKIGSRV HPDDVHTLYN MIEKAQRNVS
DFEYEHRLLM PDGSVKYLRL VAHAGRNSEH QVEYIGAVQD VTQRHLADDA LARARSQLAN
VSRVTSLGVL TASIAHEVNQ PLSGIITNAS TCLRMLSAEP PNVEGARETA LRTIRDGNRA
ADVISRLRTL FTRKDRSAEA VDLNDATKEV IALALNELHR GKVVLRPELG DDLPPVIGDR
VQLQQVIMNL MRNASDAMST IHDRPRDLLI RTESDGEAVR LSVTDSGVGF DAQSADRLFE
AFYTTKNDGM GIGLSISRSI IEAHQGRLWA TPNQGPGATF CFSLPCSTDT KVQ