Gene Acid345_2411 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2411 
Symbol 
ID4071409 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2852489 
End bp2853982 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content62% 
IMG OID637984427 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_591486 
Protein GI94969438 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCAGGAAG CCGCAACCAT CCTTCTCGTT GACGACACCG ACGCGCAGCG CTACGCGCTC 
AGCCGGGTGC TCCGGACCGG TGGGTACAGG GTGGAAGAAG CCACCAGCGG AGAAGAGGCG
CTGCAGAAAG CATCGGCGCG CCCTGATCTC ATCCTGCTCG ATGTCGGCCT CCCCGATCTG
GATGGCTACG AGGTCTGTCG CCGCATCAAG AGCGATCCGG CAACACGTTC CATTCCAATC
CTGCAGATGT CCGCATCGTA TGTTTCGCCC CGCCACAAAG TGATGGGCCT GGAAGGCGGA
GCGGACGGAT ACCTCACGCC TCCGCTGGAA GGCCCTGAGT TGTTGGCGAA TGTGAGGGCG
ATGCTGCGGA TGCGCGCCGC CGAAGACCGC GCCGCGCGGC AAACGTCAGA GATTGAAGCA
GCGCGGGCGG AACTTCATGC GGTGCTGCAC AGTCTCGCGG AAGGCTTGTT GCGCATGGAC
CGCGACGGAT GCATCTGTTA CGCGAACAGC GCAGCGGAGC GGTTGCTGGG TTATCCGAGC
GAGACCATCC GCGGTTTCCG CTTTCACGAC CTGGTGCACC GCAATTTGTC GTCATGCGCA
AAGCAATCGT GCCCGGTTGC GAAGTTTTCC GGAGATCTGA AGGGAACGCT GGAGTCGGTT
TTTGTCCGAA AAGACGGATA CCCGCTGACC ATCGAATACA CGGCATCGCC GTATGCAGTC
GGCGGTGAGG TCGAAGGCGT CGTCGTATCG TTCCGCGACA TCGGTGAACG CAAGCGTTCA
GAAGAAGCAC TGCGCGCGAC CGAGAAGCTG GCGAGCACCG GCCGCATGGC TGCGACCATC
GCGCACGAGA TCAATAATCC GCTGGAGGCG ATTACCAACC TCGTTTACCT GATCAGCGTA
TCGCCCAGCC TCGACAACGA CACGCGGCGC TACGTGGACA TGGCGCAGGC GGAACTGTCG
CGCGTGACCC ACATCTCGAA GCAGACGCTC GGGTTTTATC GGCAGTCGAG CAACGCCGGC
GAATTCAGCC TTTCCGACGT CGCCGAAGGC GTGCTGGGTG TGATGGAGCG CAAACTGAAG
GCCGCCGAAA TTGAAGTGGT CCGTCGCTAC CAGGCGAAGA TCATGCTGCG AGGCCACGCC
GGCGAAATGC GGCAGGTGAT CTCGAACTTG ATTCTCAACG CAATGGAGGC GGTCGGCACC
AAGGGGAGAA TCTGGCTGCA CATCCATAAA GGCCGCGACT GGCGAAACGG GCGCGAGGGT
GTGCGCTTCA TGATCAGCGA CACCGGTTCG GGCATTCCCC GCAACAAGCT CAAGCAGATC
TTCGAGCCGT TCTTCACCAC CAAGCAGGAG AAGGGAACCG GCCTGGGGCT GTGGGTGTCG
AACGGGATCG TGCACAAGCA TGGCGGCTAT ATGCGGGTGC GCTCGTCGCA ATCTGGCGCA
GGTCACGGGA CCTGCTTCTC CATCTTCCTG CCTCTGGTGA ATCCGCACGC GTAG
 
Protein sequence
MQEAATILLV DDTDAQRYAL SRVLRTGGYR VEEATSGEEA LQKASARPDL ILLDVGLPDL 
DGYEVCRRIK SDPATRSIPI LQMSASYVSP RHKVMGLEGG ADGYLTPPLE GPELLANVRA
MLRMRAAEDR AARQTSEIEA ARAELHAVLH SLAEGLLRMD RDGCICYANS AAERLLGYPS
ETIRGFRFHD LVHRNLSSCA KQSCPVAKFS GDLKGTLESV FVRKDGYPLT IEYTASPYAV
GGEVEGVVVS FRDIGERKRS EEALRATEKL ASTGRMAATI AHEINNPLEA ITNLVYLISV
SPSLDNDTRR YVDMAQAELS RVTHISKQTL GFYRQSSNAG EFSLSDVAEG VLGVMERKLK
AAEIEVVRRY QAKIMLRGHA GEMRQVISNL ILNAMEAVGT KGRIWLHIHK GRDWRNGREG
VRFMISDTGS GIPRNKLKQI FEPFFTTKQE KGTGLGLWVS NGIVHKHGGY MRVRSSQSGA
GHGTCFSIFL PLVNPHA