Gene Acid345_1659 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1659 
Symbol 
ID4069807 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2004416 
End bp2005570 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content57% 
IMG OID637983667 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_590734 
Protein GI94968686 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0793424 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGAAATCG AGCGTTCCAT CCCGCGTCGG GAGAAGGCAG TCTGGCAAGA AAATCCAGAA 
TGGCTCTTTG AAAGTGCACC CGAAGCGATT TTCGTGTTCG ACGACGACTT GCAGTATCTC
GAAGCAAATC CTGCCGCCTG CCGGCTCTTG CGACGGAATA AAGAAGAAAT CGTCGGCTGC
AAGATTGGAA CTTTCTCAGA AGACGGTGCC CGGCTGGCGG AGGCAATACG AAAGACCCCT
GTAGGCGACG GCGTGGAGGC GTCGCACACG CTGATTCTCC CGGATGGCAG TGTGCGAACG
GTTGAGATGG TGACCCGACC GAACATGCTG CCGGGGATCC ACCTTTCGTT CAGCCGCGAT
GTAACAGACC GGAGGCTGCT GGAGCGGGAA TTGGAACATC ACACGCGACT CGAAGCAGCG
GGCAAACTTG CGAGTGGAAT CGCACACGAC TTCAACAATA TGTTGACTGC GATCCTGAGC
TACGCCGACT TGCAACTTCA GCATGTTGAA TCCGGCTCGG CTATGCAGCG CTATGTCTTG
GGAATACAGG CGGCGGCGGA GCGCGCGGCG CAGACGACGC ATCAGTTGTT GGCATTTTGC
CGGCGTCAGA AGATGCAGTT TGCGGAGACG CAGGTCAACG ATGTGATTCA ACAGTCGTCG
GAATTGATTG GGCGGCTCAT TGGTGAAGAC ATTGAGTTGG CGTTCGATCT CGATCGTGGC
ATTCCCGAGG TATGGGCAGA CGCGGGACAG TTGAACCAGG TATTCGTGAA TCTGGCGGTG
AATGCGCGCG ATGCGATGCC GAGGGGCGGA CGACTGCTCT TCGCGACATC CAAGCGCGGG
ACGGAAGAAA AGGATAAACG CGTTTCCATT TTTGTCCACG ACACCGGAAT GGGAATTGGC
GCTGATGTGC TGCCGCACAT CTTCGAGCCG TTTTTCACGA CGAAGGAGCA AGGCAAAGGA
ACCGGTTTGG GACTCGCGAC CGTTTACGGA ATTGTGAAGC AAATGAAGGG CGAGATTCTG
GTTAGCAGTG AACCGGGGCG TGGTACGACG TTTGAGATAG CACTTCCAGC CTCGCGGGCG
GCGGGTGCTT GGGTAGAACC GGTGACCGCG AAGGCGAAGC CGAACAATAT GGAACATCGG
CCATTTGCAG AGTGA
 
Protein sequence
MEIERSIPRR EKAVWQENPE WLFESAPEAI FVFDDDLQYL EANPAACRLL RRNKEEIVGC 
KIGTFSEDGA RLAEAIRKTP VGDGVEASHT LILPDGSVRT VEMVTRPNML PGIHLSFSRD
VTDRRLLERE LEHHTRLEAA GKLASGIAHD FNNMLTAILS YADLQLQHVE SGSAMQRYVL
GIQAAAERAA QTTHQLLAFC RRQKMQFAET QVNDVIQQSS ELIGRLIGED IELAFDLDRG
IPEVWADAGQ LNQVFVNLAV NARDAMPRGG RLLFATSKRG TEEKDKRVSI FVHDTGMGIG
ADVLPHIFEP FFTTKEQGKG TGLGLATVYG IVKQMKGEIL VSSEPGRGTT FEIALPASRA
AGAWVEPVTA KAKPNNMEHR PFAE