Gene Acid345_0066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0066 
Symbol 
ID4068979 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp65299 
End bp66486 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content58% 
IMG OID637982066 
Producthistidine kinase 
Protein accessionYP_589145 
Protein GI94967097 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.126071 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAACC TCGACCGTGA TCTACGCGTT CTGGTACTGG CTCCCACCGG GCGGGATGCG 
GAACTTATAT CCGACCTGCT AACGAGCAAG GGTATCCGTT GCGAGCAGCC GGGAACGTTG
GCGATGGCCC GAATCGAGAT GGCTGCGGGC GTAGGCGCCT TGCTGGTAGC GGAAGAGACG
CTCACATTGC CGGAAATTAC GGCGTGGGCG GCACAGGTAG CGCTACAACC ATCATGGTCG
GACTTCCCAA TCATTCTTTT GACAATGTCC GGAGAGGTTG ACGAGGAGAG TCAAAGGAGA
TCGCTGCTTC GGAAGCCGCT CGGCAACGTG GTACTACTGG AACGGCCGGC GCGGCCTGAG
ACTCTGGTGA GCACAGTGCA GGCAGCTTTG CGCAGCCGTC GCCGGCAGTA CCAGATGCGG
GATTACTTGG CAGTTCAGCG GCTGGCAGAA GAAGCGTTGC GAAAGTCCGA AAAACTGGCG
GTGGTTGGAC GGTTGGCCGC GAGCATGGCT CACGAAATCA ACAACCCTTT GGCATCTGTG
ACCAATCTTC TGTATCTGAT GGGCCAGTCT GGAAGCCTGT GGGAGTGCCA GGAATATGCA
CAGACAGCGG GACGCGAACT CGCGCGTGTT TCCGAAATCG TCACCCAGAC CTTACGGATA
TACCATCAGA CAGGCAGGCC TGAACTGGTT AGCCCAGCAG AAATTGTGAA TTCCGCACTG
AACCTTTTCC GCGCGAGATT GACGGCAGCT CAGATCGTGG TGCAAACGGA TTTTCGCGAG
TGTCGGCCTG TGCTGGCGAT GGGCGCCGAA CTCAGGCAGC TAATCCTCAA CATGATAGGA
AATGCGCTGG ATGCGATGCG AACGGGGGGA ATGCTGAAAA TTCGAGTCGC GGCGAGCAGC
GAACGCAGCA ACGGCTCCCG TCCCGGCGTG CGAGTCACGA TTGCAGATTC CGGATCGGGG
ATTCATCCTG CCATCAAGGA GCAGTTGTTT GAACCTTTTG TCAGCACCAA GGGCGACACC
GGAGCGGGTC TGGGGCTGTG GGTGAGCTCC GAAATCGTCC GTAAACATAG TGGAAAAATC
CAGGTGAGAA GCAGCGTCTC GCCACCCGCT GCGGGGACTG TTTTCTCGAT ATTTCTTCCC
GAGCGGCCGC ACTTCGAATC CAGGGTCTTC TTTGCAAGAG CTGTCTAA
 
Protein sequence
MNNLDRDLRV LVLAPTGRDA ELISDLLTSK GIRCEQPGTL AMARIEMAAG VGALLVAEET 
LTLPEITAWA AQVALQPSWS DFPIILLTMS GEVDEESQRR SLLRKPLGNV VLLERPARPE
TLVSTVQAAL RSRRRQYQMR DYLAVQRLAE EALRKSEKLA VVGRLAASMA HEINNPLASV
TNLLYLMGQS GSLWECQEYA QTAGRELARV SEIVTQTLRI YHQTGRPELV SPAEIVNSAL
NLFRARLTAA QIVVQTDFRE CRPVLAMGAE LRQLILNMIG NALDAMRTGG MLKIRVAASS
ERSNGSRPGV RVTIADSGSG IHPAIKEQLF EPFVSTKGDT GAGLGLWVSS EIVRKHSGKI
QVRSSVSPPA AGTVFSIFLP ERPHFESRVF FARAV