Gene Acid345_4457 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4457 
Symbol 
ID4070940 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5287844 
End bp5289640 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content59% 
IMG OID637986496 
Productperiplasmic sensor signal transduction histidine kinase 
Protein accessionYP_593531 
Protein GI94971483 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.235161 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGACGC GCTCCAAAAT CTGGATCGGG ATCGCTGCGC TTCTGGTCGC CGCGCAGGCA 
GTGGTCAGCC TCGGGCTTCC CCATACGGGG ATGCTCCACC TGCCCTTTGG TCTCGAGATA
TCCGCACGTA TGTTCCGCAC CGCTTTCGGG GACCTGGCGC AGGCCATCAT CGTCGGTTTC
GCGGGTTGCG TGATGTTGTT GAACGGCTTT CGCTCCGAAG GGCCGGCCCG AGTGTTCTGG
ACCCTCTTCT CGCTCGGCAT GTTCTTCTGG CTCGCTGACC TCACCATCTG GTCGTACTAC
GAGGTCGTGA TCCAAACCGA CGTGCCGCAA CTCACCCTCG GCGATAGTTA TCTCTTTCTT
CACCTGGTGC CGATGGTCGC AGCGTTGGGC GCCCATCCCG ATCGCCGCGC CTCCGCCATC
GGACGTCAAC GCACCTGGCT CGACTTCGCG GTACTGCTCA CTTACTGGCT CTACATCTAT
GCGCTCATCG TGATGCCGCA CCAGTACGTA AAGCCCGATA TCCCGACCTA TAACTACAAC
TTCGACATCA TCGACAAGTG CGGCCACTGG ATATTGGTCG TCGGACTTGC GGCGGCTTTC
GTGCGATCCC GAGGCTCGTG GCGGCGCATC TTCCTGATGT TTACGCTCGC GTCGCTGTCG
TACGCCGTGT TCTCCGATTT CGCAAACCTG GCCGTAGATA CCGGCAGCTA TTACACCGGC
TCGGTGTATG ACATCCTGCT CATGGCGACC ATGGCGTTGT TTGCTCTGAC CGCGATCGAA
GGCAGCAAGA TGCCGGCTAC GCCGGATGCC GAGTTGGCGC CGGCAGCCGA GCCGCCATTG
GCGGGGTGGT CGATACAGTG GCCGGCAGTC AGCAGTACGC TGGTTACTCT TTCCATGCCG
GCCATCGGAA TTTATTTGCT GAATTATGCG CCGACGATGG ATCCGGAGAT CCGCGCGTTT
CGCTTAATCG TCACGTTCAT CGCGATGGTG TTGTTGTTCT CGCTGGTTTC GTTGAAGCAG
ACACTGCTGC AGGCTGACCT TGTAGGCTCG CTCAAGAATG TCTCCGATGC CTACAGCGAT
TTGAAGAGCG TGAAGAACCA GTTGGTACAG AGCGAGAAGC TCGCATCGAT GGGGCGGTTG
CTTGCCGGCG CGGCGCACGA GATCAACAAT CCGCTGACGG CGATCCTGGG GTACTCGGAC
TTGCTCACGT CAAGCATTTC ACTCGATCCG CAGACCCGCA GTATGGCCGA GAAAATCGGA
CAGCAGGCGA GGCGCACCAA GACGTTGGTG GAAGACCTGC TGAAGTTCTC GCAGGAAACG
CCGACCCAGC GTTCCTCGAA TGACGTACAG GTGCTGGTGC TCAATGCGAT TAAACTCGCG
GGACTCGAGG CGGGGAAGAG CGTGAAAGTG GAAGTCACCG CCCCAGATAA ACTCCCGCCG
GTCGCGGTGG ATCCCGGACA GATCCTTCAG GTGTTCGTGC ACCTGATCCG CAACGCCGCT
GACGCGATGA GCGAATCGGT GGTGCGCGTG CTTCATATCT CAACGCGCGC AGGAAGTTCG
CAGGTGCAAG TGGAGTTCGC GGACTCCGGT CCCGGCGTGA AGGATCCGGA CCTTGTCTTC
GATCCGTTTT ACACGACGAA GTCGCCGGGC AAGGGAACGG GACTTGGGCT CAGTGCGTGT
TACGGCATCG TGCAAAAACA CGGTGGGCAG ATCACGTGCG CCAATCGCCC GCAAGGCGGT
GCGATCTTTA CCGTGACGCT ACCCACTGTC GAACAGGTTG AAATGCAGAA CGCCTGA
 
Protein sequence
MTTRSKIWIG IAALLVAAQA VVSLGLPHTG MLHLPFGLEI SARMFRTAFG DLAQAIIVGF 
AGCVMLLNGF RSEGPARVFW TLFSLGMFFW LADLTIWSYY EVVIQTDVPQ LTLGDSYLFL
HLVPMVAALG AHPDRRASAI GRQRTWLDFA VLLTYWLYIY ALIVMPHQYV KPDIPTYNYN
FDIIDKCGHW ILVVGLAAAF VRSRGSWRRI FLMFTLASLS YAVFSDFANL AVDTGSYYTG
SVYDILLMAT MALFALTAIE GSKMPATPDA ELAPAAEPPL AGWSIQWPAV SSTLVTLSMP
AIGIYLLNYA PTMDPEIRAF RLIVTFIAMV LLFSLVSLKQ TLLQADLVGS LKNVSDAYSD
LKSVKNQLVQ SEKLASMGRL LAGAAHEINN PLTAILGYSD LLTSSISLDP QTRSMAEKIG
QQARRTKTLV EDLLKFSQET PTQRSSNDVQ VLVLNAIKLA GLEAGKSVKV EVTAPDKLPP
VAVDPGQILQ VFVHLIRNAA DAMSESVVRV LHISTRAGSS QVQVEFADSG PGVKDPDLVF
DPFYTTKSPG KGTGLGLSAC YGIVQKHGGQ ITCANRPQGG AIFTVTLPTV EQVEMQNA