Gene Acid345_3519 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3519 
Symbol 
ID4072778 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4159954 
End bp4162917 
Gene Length2964 bp 
Protein Length987 aa 
Translation table11 
GC content63% 
IMG OID637985542 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_592594 
Protein GI94970546 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase
[COG2203] FOG: GAF domain 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGGAT TAAATCCTGA ACTCGCTGCG CACGTCGATG CGTCCGAACA CGCCATTCGT 
CTTCTCGCGC GCGAAAACAA AATTCTCGAG TTCATAGCGC GCGGCGCCTC TCTGGAGAAG
GTGTTAACAG AGATTGCGCA CGCCGCCGAG GAGTACTGCG ACACCGAAGT TCGTTGCTCG
ATCCTCTTGC TCGACGAGTC AGGAACGCGC TTGATGCACG GCGCTGCGCC TAGCCTTCCC
GATACATATA ACAAGCTGAT TCATGGCACC GCGATTGGTC CCGAAGTCGG CTCCTGCGGC
AAGGCCGCGT TCTGCAAGAA GCCGGTTTTC GTCGAAGACA TTGACACTGC GCCCTCCTGG
TCCGCGCTGA AGCACCTCGC GTTGCCGCTT GGACTGCGCG GCGCCTGGTC CATGCCGATC
TTTGCATCGA ATGGCGATGT TCTTGGCACG CTCGGTATCT ACACCCTTGA ACCTGCGTCA
CCGACCGACC GGGCGCGCCA GGCTATCGAT CTTCTCGCTC GCACTGCGGG CATCGCAATC
GAGCGTCATC GCTCCGAATC GCAGCGGCTG CGCTATCAGA AACAAATTGA GACGCTCAAC
GACACGGGCA TTCTCCTTGC TGCCGAGCGC GACCTCCACA AGATCGTGCA GGCGGCAACG
GACACCGCGC GAGAATTCAG CGGTGCCGCT TTCGGAGCGT TTTTCTACAA CGAAATCCGC
GCCGATGGCG AGAGCTACAT GTTGTACACG CTCTCTGGCG CACCCCGCGA AGCCTTCGAG
AAATTTCCCA TGCCGCGCAA TACCGCCGTC TTCGGCCCAA CGTTCGCCGG CGAAGGCACG
GTGCGGCTGG CGGATGTGCG CAAGGATCCG CGATATGGCA AGAACGCTCC CTACCACGGC
ATGCCGGAAG GGCACCTGCC CGTGTGCAGC TATCTCGCGG TGCCGGTGGT TTCGCGCTCG
GGGAAGGTGC TCGGCGGACT GTTCTTTGGC CATTCTGAGC CGAACCGGTT CACCCTGGAA
GCACAGCACC TGGTCGAGAG CATCGCCGCG CAGGCAGCGG CCGCCATCGA CAACGCACAG
CTCAACGATC GTATTGCGAG ACAGTTGGCT TCATCCGAGG AAGTGCAGCA GCGGCTCGCT
ATCGCCCAGC AGGCGGCGCA GCTCGCGACC TGGGAACTGG ATTTCCGTAC CGATGAAATC
CGGTTCTCGC CGGGCAGCTG GCCGGTGTTC GGGTGCGACC CGTCGGAGAT CAAGAGCCGC
GCCGATTGGG AGCGTCAAAT CCATCCCGAC GACCGCGACA TCGTCCGCAA CGAGCTCGAG
AGCTGCGTCC AGAATGCCAA AGCCTACTTC GTGGAGTATC GCGTCCAGAG CCCGCTCGGT
GTGCGATGGG TGCAGGGGCG CGGCCACGTG GTGTATCACG CGGAAACCGC GCGCCCGGAG
CGCCTGATCG TCCTCAGCAT CGACATAACC GAACGCAAGC TTGCCGACGA AGCCCTGCGC
ATCAGCGACC AGAAGTTCCG CGAGGCGCAG AAGGCCGCCA ACATGGGCAC CTGGTTCTGG
GACATCCCGA CCGACAAAGT CACGTGGTCC ATGGAGGTCC CTTCCTTCGA CGCTGCAGTC
TCCGCCGACC GGCTGAAGAA CTGGGTGAAC GCGGTCCACC CCGACGACCG CCCCGCCGTT
GTTGCCGAAC TCGATCGTGC GCTTCGTCAG GGCGGCCCCT TCAAGATCGA GCATCGCCTT
ATCAGGCAAG ACGGCGCTCA GCGGTGGTCG TTCACGCAAG GCCAGATCAT GCTGGGTGAG
GATGGCACAG CGCTTTCGGG TCTCGGCATC ACCATGGACA CGACCGCGCG GCGCGAGGCG
GAGACGGAAC TCAAGCGTGC CGAGGAGCGC TTCAACCTCG CCGTGGACGC CGCTGACCTG
GGCTTCTGGT ATTGCGACTT GCCGTTCGAC GTTCTCGGTT GGGATGAGCG CGTGAAGGAG
CACTTCTGGT TGCCGCCCGA CGCCAAAGTC ACGGTCGACG ATTTCTATCG GATCCTCCAT
CCCGAGGACC GCGAACGAAC CCGCCAGGCG ATCGAAACCT CCATCAACCA AAAGAAGCGT
TACGACGTTG ACCATCGCGC TGTCTCGCCG ACAGGCGAGG TGCGCTGGGT GCGCGCGGTG
GGCCGCGGGT TCTACGACGA GACCGGGAAC CCGGTGCGCT TCGACGGCGT GACCATGGAC
ATCACCGAGC GGCGCAAAGC GGAAGAGGCG TTGCGCAGTT CCGAGAAGCT CGCCGCGACC
GGACGCCTTG CTGCGACCAT CGCGCACGAG ATCAACAATC CTCTCGAAGC GGTCACGAAC
TTCATCTATC TCGCCAAGAC GACCGACGGC GTCTCAGACC AGGTTCGCTC CTATCTGGAG
ATTGCTGACC AGGAACTGGG CCGCGTATCG CACATCGCGC GGCAGACGCT CGGCTTCTAT
CGCGACAGCA GCGGCCCAAT CCTGATGAGC GTTCCCGACA TCGTGCAGGA CGTCGTGAAC
CTCTACCAGC GCAAGCTGCT CTACAAGTCG CTCGAACTCA AGCTCGACGT GCAGTCGGAC
CTCACGATCC GCGGCCTCGC CGGCGAAATG CGCCAGGTGC TTGCGAATTT GCTCGTGAAC
GCGATCGACG CGTCGAACGA CGGCGGCCGC ATCTGGATCC GAGCGCGCCG CGTGGTAGAC
CTCAAGACCG GCGGCAAAGC GGTGCGGCTC ACCGTGGGCG ATTCAGGCAT CGGCATGAAT
GAAGAAGTTC GCAAAAAACT TTTCACGCCG TTCTTCACCA CCAAGTCCGA CGTCGGCACC
GGCCTCGGCC TGTGGGTCAC GCGCGGCATG GTCGAGAAAG CGAAGGGAAG AATCCGGGTG
CGCAGCCGCC AGGGGATTGG CACCGTTTTC TCCATGCTGT TTCCGTCAAC GAAGTATCCG
CCGCCGTCGG TGCAGCCGGC GTGA
 
Protein sequence
MSGLNPELAA HVDASEHAIR LLARENKILE FIARGASLEK VLTEIAHAAE EYCDTEVRCS 
ILLLDESGTR LMHGAAPSLP DTYNKLIHGT AIGPEVGSCG KAAFCKKPVF VEDIDTAPSW
SALKHLALPL GLRGAWSMPI FASNGDVLGT LGIYTLEPAS PTDRARQAID LLARTAGIAI
ERHRSESQRL RYQKQIETLN DTGILLAAER DLHKIVQAAT DTAREFSGAA FGAFFYNEIR
ADGESYMLYT LSGAPREAFE KFPMPRNTAV FGPTFAGEGT VRLADVRKDP RYGKNAPYHG
MPEGHLPVCS YLAVPVVSRS GKVLGGLFFG HSEPNRFTLE AQHLVESIAA QAAAAIDNAQ
LNDRIARQLA SSEEVQQRLA IAQQAAQLAT WELDFRTDEI RFSPGSWPVF GCDPSEIKSR
ADWERQIHPD DRDIVRNELE SCVQNAKAYF VEYRVQSPLG VRWVQGRGHV VYHAETARPE
RLIVLSIDIT ERKLADEALR ISDQKFREAQ KAANMGTWFW DIPTDKVTWS MEVPSFDAAV
SADRLKNWVN AVHPDDRPAV VAELDRALRQ GGPFKIEHRL IRQDGAQRWS FTQGQIMLGE
DGTALSGLGI TMDTTARREA ETELKRAEER FNLAVDAADL GFWYCDLPFD VLGWDERVKE
HFWLPPDAKV TVDDFYRILH PEDRERTRQA IETSINQKKR YDVDHRAVSP TGEVRWVRAV
GRGFYDETGN PVRFDGVTMD ITERRKAEEA LRSSEKLAAT GRLAATIAHE INNPLEAVTN
FIYLAKTTDG VSDQVRSYLE IADQELGRVS HIARQTLGFY RDSSGPILMS VPDIVQDVVN
LYQRKLLYKS LELKLDVQSD LTIRGLAGEM RQVLANLLVN AIDASNDGGR IWIRARRVVD
LKTGGKAVRL TVGDSGIGMN EEVRKKLFTP FFTTKSDVGT GLGLWVTRGM VEKAKGRIRV
RSRQGIGTVF SMLFPSTKYP PPSVQPA