Gene Acid345_0821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0821 
Symbol 
ID4072347 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1018800 
End bp1020221 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content53% 
IMG OID637982830 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_589900 
Protein GI94967852 
COG category[T] Signal transduction mechanisms 
COG ID[COG4564] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACTTG AAACAAACGA AGAGGTCAAG GCTCGCTTCT CCGACGCCTT CAAAGCAAGT 
CCCGTCGGAA TCGTCGTCGA GAACTTAGAT GGGCAGCCGC TCTTTGTGAA TCCAGCATTC
TGTTCGATGT TGGGATTTAC CGAGGAAGAG TTGCGCAATA AGCATTGTGT GGATTTTTCT
CCGCAGGAAG ATGCTCAAAA GGACTGGGTC CTGTTTCAGC AGCTGCGCGA AGGATCAATC
GACCATTATC AAATCGATAA ACGCTATTTC CGAAGCGACG GTACAACCGT CTGGGGACGC
CTCACTGTTT CGTTGCTTCG TGGCAAACCC TCTCCGCTTG TGCTTGCGAT GGTGGACGAC
ATCACTGAAC GTAAGCTTGC CGAAGAAGCC CTCAGTGAGA GCGAAACACG GTTCCGCAGC
ATCTTTCGAG ATGCTGGAAT CGGCATGGTG ATCGTTTCTC TTGATGGGCG TTATCTCTCT
GCCAATCGCG CCTTCTGCGA CTATCTGGGA TATTCAGAGC AAGAAATTCT TGGCAAGACG
ATTGAATCGA TCACTCTCCC GGCGGATTGG CCGGCGTTTT CAGAAAAGAT GCGCGAAGCA
CTCTCGGGTC AAGGATTTCA GTGGTTACAT AAACGCTGCC TGCACAAGAG TGGCCGAATC
GTCTACACGG AGACTAGTTC ATCAGTCATT CGCGATCGTG AAGGCGTCGC GCGGTATTTC
GTAGCCCAGA TTCTGGACAT TACGGGTCGC AAGGAAGCCG AAGAAGCGCT TTCAGCCATG
ACGCGAAAGC TGATAGAAGC TCAAGAGCAA GAACGCGCCC GCATTGCGCG CGAGCTCCAT
GATGACATCA GTCAACGTTT GGCTGTGCTG GCGATTGATC TCGATGGCCG GGAAGGTGTT
CCTCAAGAGG TTCAATCCCA TCTGGAAAAA TTTCGGCTCC AGGTGGTTGA AATTGCCAAT
GACGTGCACG GCTTGAGTCA CGAGTTGCAT TCTTCGAAGC TGGAGCTTCT TGGAGTGGTC
ACAGCTATGA GGAGCTGGTG TCAAGAAGTG GGTCGTCGGC AAAAAATCGA CGTCGATTTC
GCGAGCGATG TTTCGACCTC AATCCCTCCG GAACTCGGCC TTTCGCTTCT GCGGGTGCTC
CAAGAGGCAC TCCATAACGC CACCCGACAC AGCGGGACGC AGCGAGTTGA AGTACGGCTG
GAAGAACGCT CGAATGAACT CCATCTCCTG GTTCGCGACT CGGGTAAAGG GTTTGATGTC
GAATCGGCAC TGCGAAGTCA AGGCCTCGGC CTTACGAGCA TGCGTGAACG CGTTCGTTTG
CTAAATGGCA GGTTGACCGT TCAATCGCAG GCCCGTTGCG GCACTGAAGT TCATGCCCGC
GTCCCCTTCA ACCCCGATCA AAATCGGTCT TTGAAAGATT GA
 
Protein sequence
MQLETNEEVK ARFSDAFKAS PVGIVVENLD GQPLFVNPAF CSMLGFTEEE LRNKHCVDFS 
PQEDAQKDWV LFQQLREGSI DHYQIDKRYF RSDGTTVWGR LTVSLLRGKP SPLVLAMVDD
ITERKLAEEA LSESETRFRS IFRDAGIGMV IVSLDGRYLS ANRAFCDYLG YSEQEILGKT
IESITLPADW PAFSEKMREA LSGQGFQWLH KRCLHKSGRI VYTETSSSVI RDREGVARYF
VAQILDITGR KEAEEALSAM TRKLIEAQEQ ERARIARELH DDISQRLAVL AIDLDGREGV
PQEVQSHLEK FRLQVVEIAN DVHGLSHELH SSKLELLGVV TAMRSWCQEV GRRQKIDVDF
ASDVSTSIPP ELGLSLLRVL QEALHNATRH SGTQRVEVRL EERSNELHLL VRDSGKGFDV
ESALRSQGLG LTSMRERVRL LNGRLTVQSQ ARCGTEVHAR VPFNPDQNRS LKD