Gene Acid345_0475 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0475 
Symbol 
ID4069470 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp590313 
End bp591674 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content56% 
IMG OID637982479 
Producttwo component, sigma54 specific, Fis family transcriptional regulator 
Protein accessionYP_589554 
Protein GI94967506 
COG category[T] Signal transduction mechanisms 
COG ID[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.318452 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAAAC CATCCCTTCA CATTGTCGTA GTAGACGACG ATCCCGGAAC GTGTGTGTAT 
ATCGAGAGCG TGTTCGCGGA ACTCGGGCAC ACCTGCAAGA GCTTCGTTCG GCCGGAAGCG
GCCGAGGAGT ATATCCTCAC CCACCCGGTG GACCTCGCAA TCGTGGACGT ATACCTCGGC
TCTACGACCG GCGTTGAAGT ATTGCGGCGC TGCCGCGTGC ACCGTCCGAA ACTTTACGCG
GTCATCATCA CCGGGCAAAT CAGCCTTGAA ATGGCGGCAC GATCCATCGC AGAGGGAGCG
GTGGATTACA TTCAGAAGCC CATCGATATC GACGCGTTGC TCAACATCGC GGAACGCGCT
CTGGAGCACA AGGAGCGCAG CGGAGAAACG CCGGCGGAAC AGGAGGAATC CGAGTCGAAA
ATCGTCGGCA GCAGTCCTTC GATGCTCGAG GTGTACAAAC TCATCGCGCG TGTGGCGCCG
AGTAATGCGA ATGTGCTGAT CACCGGCGCA AGCGGTACCG GCAAAGAATT GGTAGCGCGG
GCCATTCACG AGCACTCGAG GCGAGCAGAG ATGCCCTTTA CGCCGGTCAA CTGCGGGTCG
TTCGCCGAGA CGTTACTGGA GAGCGAACTT TTCGGGCACG AGAAAGGCGC GTTCACGGGT
GCCGATTCAG TCAGAAAAGG ATTGATCGAA TCGACGCAGG GCGGAACGCT GTTCCTGGAT
GAGATTACGG AAACGTCCCT CGGCTTCCAG GTGAAGTTGC TTCGTGTTTT GCAGGAGCAA
CAGCTACGCC GTATTGGATC GAACAAGATT ATTCCGATCG ATGTTCGCAT CCTGGCTGCG
ACGAACCGCG ACGTTCCGTC GCTCATCCGC GAAGGCAAAT TCCGCGAAGA TCTTTACTAT
CGGCTCGCCG TGGTACAGAT CAAGATCCCG ATGCTGGCGG AACGACAGTC TGATATCCCG
TCATTAGTCA CGCATTTCCT ACGTCAATTC AATGAGCGCA ATCAGGCGCG GGTGTCGATT
GAGCAGAGCG CGGTGGAATT GCTGCAGAAA CGGAGTTGGC CGGGTAACGT CCGCGAGTTG
GAGAACACAA TTTACCGGCT CGCAATTTTT GCTTCCACAG GAAGGATCAC TGGGGCGGAC
GTCGAACGAG AACAAGAGTC ACAGAAGAAT GGGCCGAAGG CGGAACCTGT CAGTGCTCCC
GACCGGCTCG TCGAGATGGA AAGACATCAG ATCTTGCGCA TTTTGAAAGA CGTTCACGGG
AACAAGAGTG AAGCGGCGAG GCGCCTCGGC ATTGAAAGAA AGACGCTCTA CAAGAAGGCT
GTTCGCCTCG GGATAACCCT CGATGCTTCT GAATATCAAT GA
 
Protein sequence
MPKPSLHIVV VDDDPGTCVY IESVFAELGH TCKSFVRPEA AEEYILTHPV DLAIVDVYLG 
STTGVEVLRR CRVHRPKLYA VIITGQISLE MAARSIAEGA VDYIQKPIDI DALLNIAERA
LEHKERSGET PAEQEESESK IVGSSPSMLE VYKLIARVAP SNANVLITGA SGTGKELVAR
AIHEHSRRAE MPFTPVNCGS FAETLLESEL FGHEKGAFTG ADSVRKGLIE STQGGTLFLD
EITETSLGFQ VKLLRVLQEQ QLRRIGSNKI IPIDVRILAA TNRDVPSLIR EGKFREDLYY
RLAVVQIKIP MLAERQSDIP SLVTHFLRQF NERNQARVSI EQSAVELLQK RSWPGNVREL
ENTIYRLAIF ASTGRITGAD VEREQESQKN GPKAEPVSAP DRLVEMERHQ ILRILKDVHG
NKSEAARRLG IERKTLYKKA VRLGITLDAS EYQ