Gene Acid345_3845 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3845 
Symbol 
ID4070997 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4554276 
End bp4555640 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content61% 
IMG OID637985869 
Productperiplasmic sensor signal transduction histidine kinase 
Protein accessionYP_592919 
Protein GI94970871 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.340802 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.895738 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAAGA GCCTCTTCGG AAAGATCTTC CTGTGGTTCT GGCTGACGTC GATTTTGCTC 
ACGGTCGCGT CGGTGTTTAT TGCGGTGTTT CTTACGGGCA CGCCGGTGTT GCGGCAGTGG
GCATCGAATT TTGGCGATTT GTATTCGCGC AATGTGGTGC AGTCGTATTT GCTGGGCGGA
GACGCGGGAC TGGACCAGTT CCTGACGTCC ACGGAACAAG CGCGCGAAAT TCAAACGACG
CTGTTCGGGC CCGACGGAGA ACGGATTCGC GGCGGGGCCC TCACGCCGTT TCAGCAGGGA
CTGTTGGAGG AAGCACGCGC GTCGGGACGG ACGGAGTGCC GTATCCGGCT GGCATGGCAG
TGTGCGGCAG TGGTGGATAC GCCGCGCGGG AAGTTTGTTG TAGTCGGGAG GGCGTTGCAT
CCGCGGCGCA TGGTGCGGCA ATTGCCGCTA TCGGCGGTGC TGACGCGCGC GATGCTGATC
CTGCTTTTCG CGAGCGGGTT GTGCTTTGCG CTGGCGCGGT ATATTGCGCG CCCGGTGGAG
GTTTTGCAAC ATGCGACGCG AAGGATCGCA GCGGGAGACT TGAGCGCCCG AGCCGCACCA
GCGCTGGCTC CGAGAAAAGA CGAACTGGTT TCGCTGGCGG AAGAATTCGA TGTGATGGCG
GCGCGCATTG AGGCCCTGAT GCAATCGCAG CGGCAGATGC TGGGAGACAT TTCGCACGAA
CTGCGGTCGC CATTGACGCG GCTGCGCGTC GCGCTGGAAC TCGCGGAAGG CGGCGATGCC
GAGGCGATGA AGCGAATGAA TGCGGATCTG CAGAAGCTGG AGCAGCTGAT CGAACAGGTG
TTGACACTCA ATCGGCTGGA TGCGGGCGAG AAGTGGGTCG AGCTTCAGTC AACGAATTTG
GAAACTGTCT TGAGCGAAGT TGTGCGCGAC GCGAATTATG AAGGACGGGC GCGGAATGTG
AGCGTCGAAC TGAAAGCGGA GCCGCTGACG ATTAAGGCGA ACCCGGCGCT GCTGAAGAGT
TGCGTGGAGA ACATTGTGCG CAACGCATTG CGATATTCAC CCGACAACGG AAGAGTTGAA
GTGGAGGAGA GGGCGGTTTC AGATTCCACC GGAAGGGTCG GGCACCCACA GTGGGTGGAG
ATCAGAGTGC GGGACCACGG GCCGGGCGTG CCGCCGGAGG CGCTGCCGCG ATTGTTTGAG
CCGTTCTATC GCGTGGCGGA GTCGCGGAGT GAGAAGTCGG GGGGAAGAGG GCTGGGGCTG
TCGATTGCAC AGAGAGCCGC GGCGGTGCAT GGGGGAACTG TCGAGGCGAA GAACCGCGAG
GGCGGGGGGC TGGAAGTGCT TGTGAAGTTG CCGGTGCGTT CGTAA
 
Protein sequence
MMKSLFGKIF LWFWLTSILL TVASVFIAVF LTGTPVLRQW ASNFGDLYSR NVVQSYLLGG 
DAGLDQFLTS TEQAREIQTT LFGPDGERIR GGALTPFQQG LLEEARASGR TECRIRLAWQ
CAAVVDTPRG KFVVVGRALH PRRMVRQLPL SAVLTRAMLI LLFASGLCFA LARYIARPVE
VLQHATRRIA AGDLSARAAP ALAPRKDELV SLAEEFDVMA ARIEALMQSQ RQMLGDISHE
LRSPLTRLRV ALELAEGGDA EAMKRMNADL QKLEQLIEQV LTLNRLDAGE KWVELQSTNL
ETVLSEVVRD ANYEGRARNV SVELKAEPLT IKANPALLKS CVENIVRNAL RYSPDNGRVE
VEERAVSDST GRVGHPQWVE IRVRDHGPGV PPEALPRLFE PFYRVAESRS EKSGGRGLGL
SIAQRAAAVH GGTVEAKNRE GGGLEVLVKL PVRS