Gene Acid345_2554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2554 
Symbol 
ID4072198 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3014982 
End bp3016637 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content58% 
IMG OID637984571 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_591629 
Protein GI94969581 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase
[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.344016 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTACGTTT GGATCGCAGC CACGCTGACC GTGGCAGCGC TGCTGTGCTG GGTGGTGACT 
GTCCGTCATC GCGCCACACG CCTTTCCCGG CGTATTGACG AATTGCAACG CGAAATGGGT
GTTCGCGAAT GCATCGAACG CCAACTCCAG GCCAGCCAGT CGAACCTCCA GATGGCTACG
GATGCGTCCG GTGTTGGGAC CTGGTATTGG GATTTAGCAT CCGACGAACT CTCTTGTTCT
GACCAGGCGG TAAGGCTGTT CGGACTCACG AACACTGGCA AGATCTTCAA GTCACCCGAT
TTCGTAAACG CCATTCACCC CGACGATCAG GACCGTGTTC TTGAAACGGT TGCGAACGCA
TTCCGCGAGG CGAGCAAATA TCACGTGGAG TACCGCGTCT TCCGGCCGGA TACGAGCGTC
ATATGGCTGG CCACTTCCGG CCGCGCGCTG AAAGATCCCG ATGGCGAAGT ACGACGGGTT
GCGGGCATCA CAGTGGATGT GACCGACCGA AAACGCCTGG AGGAGAGCTT CTATCAGGCG
CAGAAAATGG AAGCGGTTGG CCGACTCGCG GGTGGCGTGG CGCACGATTT CAATAACATG
CTGGGCGTCA TTTCTGCGTA TGCGGAGCTT TTGCGGGAAG AACCTAGTCT TTCTCTTCGA
GCGAGCAAAC GCGTCGACGA GATCCTGAGT GCGACCCAGC GTGCTAACGC GCTCACGCGT
CAGCTTCTTG CTTTCAGCCG GAAGCAGGTT ATCAGTCCAA CCACTCTCGA TCTGAACGCT
GTTGTATACG GCGTGAAGGA CATGCTGCAG CGGCTTGTTG GGGAAGATGT TCGCATCAGC
GTGATGGCCA CGCCCAAGCT GCCTCCGATC AAGGCGGACC GCGGTCAACT CGAGCAAGTG
CTGCTGAATT TTGCCGCCAA TTCTCGCGAT GCCATGCCCA AGGGAGGCCG CTTCACGCTT
CGGACTTCTC TCGAGCCCGC ACCTGCTGAT CTCCCGCGCC CGCTCACCGG AGACTGTGTG
TGCCTTGAGG TTTCGGATAC TGGCGACGGC ATGAGTCCCG AAGTGATGAA ACACATTTTC
GAGCCGTTCT ACACCACGAA ATCGTCGGGC AAGGGAACTG GTCTCGGCCT GGCAACGGTG
TATGGAGTGG TCGAACAGGC ACACGGCACA ATCCGCGTGG AGAGCGCGCC CGGCAAGGGA
ACTACCTTCC GCGTTTATCT GCCGGCGATG CCGGGCCAGG TCGCCGAAGC TGCCAACACG
GACATGCCGA AGGCACCAGT CTCGGTTAAG GCAAGTGTTC TGCTCGTGGA AGACGAACAA
TCGTTGCGGG AAGTCTTAAC TGAATTTATG CAGACCGCCG GAGTTCAAGT GACGGCGGTG
TCGAGCGGCC GAGAAGCGGT CGCGCGGATC AATTCCGACG CCGAAGTGGA CATTCTGCTT
ACCGACCTGG TGATGCCGGA GATGGATGGC CGCTCACTGG CTCAAATCGC GCGAACCAAG
CGTCCCAGTC TGCAAATCAT CTACATGTCG GGCCACACCA ACGACACGCT CACGCAGAAA
GAACTGGTTG CCGATGGCCT GCCTTATCTG CAGAAGCCAT TCACACGCGC CGACCTCCTC
AAAGCTTTGA CCGCTGTGCT GAAACAGACC GCTTAG
 
Protein sequence
MYVWIAATLT VAALLCWVVT VRHRATRLSR RIDELQREMG VRECIERQLQ ASQSNLQMAT 
DASGVGTWYW DLASDELSCS DQAVRLFGLT NTGKIFKSPD FVNAIHPDDQ DRVLETVANA
FREASKYHVE YRVFRPDTSV IWLATSGRAL KDPDGEVRRV AGITVDVTDR KRLEESFYQA
QKMEAVGRLA GGVAHDFNNM LGVISAYAEL LREEPSLSLR ASKRVDEILS ATQRANALTR
QLLAFSRKQV ISPTTLDLNA VVYGVKDMLQ RLVGEDVRIS VMATPKLPPI KADRGQLEQV
LLNFAANSRD AMPKGGRFTL RTSLEPAPAD LPRPLTGDCV CLEVSDTGDG MSPEVMKHIF
EPFYTTKSSG KGTGLGLATV YGVVEQAHGT IRVESAPGKG TTFRVYLPAM PGQVAEAANT
DMPKAPVSVK ASVLLVEDEQ SLREVLTEFM QTAGVQVTAV SSGREAVARI NSDAEVDILL
TDLVMPEMDG RSLAQIARTK RPSLQIIYMS GHTNDTLTQK ELVADGLPYL QKPFTRADLL
KALTAVLKQT A