Gene Acid345_3503 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3503 
Symbol 
ID4072762 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4133242 
End bp4136373 
Gene Length3132 bp 
Protein Length1043 aa 
Translation table11 
GC content57% 
IMG OID637985526 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_592578 
Protein GI94970530 
COG category[T] Signal transduction mechanisms 
COG ID[COG2202] FOG: PAS/PAC domain
[COG5002] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACCAG AAACCATCTC TTTCGATCGT CAGCCTTCGC CGGGTATGGA ACGCCGCGCG 
CGCTCATTCC GACCGGCCGT GGCATTTGCA TTTCTCGCGG GCATGGCGAT CATCCTGATC
GTGGGTTGGC TCAGCTATCG CACCACGACC ACCCTGATCG AAGACAGTTC CTGGACATCG
CATACCCAGG ATGTAATTAC CAATCTCGAC CAACTGCAAT CCACCCTGGA GCACGCCGAG
TCATCCCAAG GGCGTTACCT CATCACCGGC GACGAAACAT TCCTGAAGAA CTATGAATTG
GACGTCCGGA CCACGCGTGA GCTTAGCCGG AACCTGTTGC AACTGACTTC CGATAATGCC
ACGCAGCAGT TGCGCTTAAA GGACCTCCAA GGGCTGATCG AACAAAAAAT CGATCACATG
AATTTGACCC TCGTTTTGAG GCGAAATAAC GGATTGACGG AGGATGTGCA ACGTCGTGTC
GCGACCGAAG GCAAGCGTCG GATGGATGAG ATCCAGACGA AGGTCTCGGA AGGCGTCGCA
CTTGAAACCA GGCTTCTGAG CGTACGGATC GAGGCGCAAC GCCGCAGCGC GGGGAAATCA
CTCCGAAGCA TCCTCACCGG CGGATTGCTC GCAATGTTGT TCCTAGCGGC GGCGGGACTG
GTCCTGCAGC GCGATATCCA GAAACGATTT GCAGTCGAGC GGCAACTCCA GCGCACAACC
GCCCTGCAGC GAGCGGTGCT GAACAGCGCG AATTACGCCA TCATCTCCAC CGACACCTCC
GGCACGATCA TCAGTTTCAA CTCGGCGGCT GAACAGATGC TTGGCTATCA CGCGAGTGAA
GTAGTCGGCC GGCTCGCCCC GGAAAAGCTC CACGACCCTA CGGAACTCGA ACAGCACGCC
GAACAAATGA GCCGGTTCTT CGGACAGAGC ATCTCCGCCG GCTTCGAATC TCTGATCGCA
AAGGCTCGAT TAGGAACCAT CGACGAAAGC GAATGGACCT ATGTCCGCCG TGACGGTTCC
CGCTTCTTTG GCCTGCTCTC CACCAGCGCG ATGCACGATG AAAACGGCGC TATCACCGGG
TACGTATTCA TCGTCAGCGA TGTCACCCGG CGCAAAGATG CGGAGAAGGC CAAGAGCCAA
ATCGAACGGC GCTACCGTGC GTTGCTGCAA AACAGTAGCG ACATGGTCGC CGTAATCGAC
GCGGCTGGAC ACTTGCAGTA CATTAGCCCG GCAGTCGAAA GGCTGCTGGA ATTCGAAGTA
CAGGAACTAG TCGGCCGCGA GATCTTCGAC ATCATTCATC CCGCGGACGT GGAAACCGCG
CGGACCTCTT TCTACTCGAT CGCTTTGACT CCGGGTTACT CTGCTCCGCA GGAACTGCGG
TTGCGTCGCG CCGACGGCGA ATACCTAACC ACGGAGATTG TCGCCAACAA CCTCCTCACC
GACGAAGTGC TGCACGGCAT CGTTTTAAAT GCTCGCGACA TCACCGAGCG CAGCCGCGCC
CGGGCACAAC TCGAAGTGCA GAACGCCGTT GCTCGTGTGT TGGCGGAAGC GGAGAACCTC
GACCAATCGA TTCCCGAGAT CTTGCAGGCT CTCTGTAACA ACCTTGACTG GGAACTGAGT
GAATTTTGGG GAGTAGATCC TGAACAAGAC TCGATGACCT TCAACTTCGC GTGGTCGCTT
CCTGGAATCG ATCTGAGCGA GTTCCTCGAT ATCAGCCAGC ACACCCGCAT CCAGCGCGGC
GAGGGACTCG CCGGCCGAGT TTGGGAGAAG GCGACAGCCA TCCAGGTTCC AGACATCACG
CAGGAAGAGA ATTTCGTTCG CAAGATCGAA GTCGAAGCAC TTTCGCTGAA GACAGCTGTC
GGCTTTCCCA TCCGTTCGCG CGAAGGCGTG ATCGGCGTGT TCACCCTGTT CAGCATGCGG
CATCGTCACG TGGACAACCA CCTCCTCTCG ATGCTGAATA CGGTGGGAGC GCAAATCGGC
CAATTCATTG CACGCAAGCG CGCCGAACAG GAAATTACCC AGAACGAGGA TCGTTACCAC
TACCTGTTTG AGAATTCGGC GGACTTGATC CTCACCTTTG GGACTGACGG CACGATCCTG
CATCCGAACT CCACGTGGAT GAGCACGCTG GGATATTCCC GCGAGGAACT CCTGAAAAAG
CCGCTCTTCG ACCTCATCGG TCCCGAAGAC CGCGAACGCT GCAAAGCGAT CATCGGAATG
ATCGTGAGGA GCGGCAGCAC GGACAAGGTT GAGCTCACCT TCCGATCGCA GGATGGTCGC
AAGATCGTCG TCGAAGGCAC AATCAGCTGC CGGTACGGCA TAACGGGCGT GGAATATTGC
AGTGCGATTT TCCAGGATGT AACCAAGCGG CGTGAAGTCG ATCGTATGAA GAACGAGTTC
ATCTCGGTGG TGAGCCATGA ACTACGCACG CCGCTGACAT CTATCCGCGG CTCGCTCGGG
CTGCTCGCTG GAGGCGCCTT ACGTAAAGAT CCGGAGAAAG CCGACCGGAT GCTCGACATC
GCACTGAAGA ACACCGAGCG ATTGGTGCGG CTCATCAACG ACATTCTCGA CATCGAGAAG
ATCGAATCCG GTAACATCGC GCTAAACGTC CAACCGCTCG ATGCCGCAGA TCTGATTTCG
CAAGCCAGTG CAACCATGCA TGCCATGGCA GACGCTAACA AGGTTCGGCT GGAGACCCAT
TCGACGCGGG GCATCCTTTA TGCCGACCGC GATCGTATGC TTCAAACCCT CACCAACCTG
TTAAGCAATG CCATCAAGTT TTCCAAGCCC GACAATACCG TGACGATCAG TTCCCAGCGC
CGGGGAGGGG GGCTCCTGAT TCGCGTGCGT GACCAGGGCA GAGGCATTCC GTCTAACAAG
CTGCAAACGA TTTTCGAGCG TTTCCAGCAG GTAGATGCGT CGGACTCGCG CGACAAAGGC
GGTACAGGTC TTGGCCTGGC GATCTGCCGC AGCATCGTGC AGCAGCACGG CGGATCGATC
TGGGTCGACA GCATCGACGG AAAAGGTAGC GAATTCTTTA TCCTGCTTCC CCGCTTCCAG
GAAGAAGACG CCTCCATAGT GCAAGCCGAT GCTTCCCCCG GTCCCACTTC CGGGGCCGCC
CCTGCAAATT AG
 
Protein sequence
MSPETISFDR QPSPGMERRA RSFRPAVAFA FLAGMAIILI VGWLSYRTTT TLIEDSSWTS 
HTQDVITNLD QLQSTLEHAE SSQGRYLITG DETFLKNYEL DVRTTRELSR NLLQLTSDNA
TQQLRLKDLQ GLIEQKIDHM NLTLVLRRNN GLTEDVQRRV ATEGKRRMDE IQTKVSEGVA
LETRLLSVRI EAQRRSAGKS LRSILTGGLL AMLFLAAAGL VLQRDIQKRF AVERQLQRTT
ALQRAVLNSA NYAIISTDTS GTIISFNSAA EQMLGYHASE VVGRLAPEKL HDPTELEQHA
EQMSRFFGQS ISAGFESLIA KARLGTIDES EWTYVRRDGS RFFGLLSTSA MHDENGAITG
YVFIVSDVTR RKDAEKAKSQ IERRYRALLQ NSSDMVAVID AAGHLQYISP AVERLLEFEV
QELVGREIFD IIHPADVETA RTSFYSIALT PGYSAPQELR LRRADGEYLT TEIVANNLLT
DEVLHGIVLN ARDITERSRA RAQLEVQNAV ARVLAEAENL DQSIPEILQA LCNNLDWELS
EFWGVDPEQD SMTFNFAWSL PGIDLSEFLD ISQHTRIQRG EGLAGRVWEK ATAIQVPDIT
QEENFVRKIE VEALSLKTAV GFPIRSREGV IGVFTLFSMR HRHVDNHLLS MLNTVGAQIG
QFIARKRAEQ EITQNEDRYH YLFENSADLI LTFGTDGTIL HPNSTWMSTL GYSREELLKK
PLFDLIGPED RERCKAIIGM IVRSGSTDKV ELTFRSQDGR KIVVEGTISC RYGITGVEYC
SAIFQDVTKR REVDRMKNEF ISVVSHELRT PLTSIRGSLG LLAGGALRKD PEKADRMLDI
ALKNTERLVR LINDILDIEK IESGNIALNV QPLDAADLIS QASATMHAMA DANKVRLETH
STRGILYADR DRMLQTLTNL LSNAIKFSKP DNTVTISSQR RGGGLLIRVR DQGRGIPSNK
LQTIFERFQQ VDASDSRDKG GTGLGLAICR SIVQQHGGSI WVDSIDGKGS EFFILLPRFQ
EEDASIVQAD ASPGPTSGAA PAN