Gene Acid345_3512 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3512 
Symbol 
ID4072771 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4149774 
End bp4150661 
Gene Length888 bp 
Protein Length295 aa 
Translation table11 
GC content60% 
IMG OID637985535 
ProductECF subfamily RNA polymerase sigma-24 factor 
Protein accessionYP_592587 
Protein GI94970539 
COG category[K] Transcription 
COG ID[COG1595] DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.862108 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGAGA AAAAATTCCT CGCAGAACGA TTTGAAGAGA ACCGCAGTCA TTTGCGCGCG 
GTGGCGTACC GGATGCTGGG CTCGGCGACT GAGGCCGAGG ATGCTGTGCA GGAAGCATGG
CTGCGCCTGA ATCGCGCGGA CACGCAACAG GTGAATAACC TGAATGCGTG GCTGACGACG
GTGGTGGCGC GGGTATGTCT GGACACGCTG CGCTCGCGGA AAGCGCGGCG CGAGGAAGCG
CTGGATACAG AAGTTACGGA GCCGGTCGCG TCGACTCAAA AAAGAAACAA TCCAGAGCAG
GAAGCGATGC TCGCAGACTC GGTAGGAATC GCGCTGCTGG TGGTGCTGGA CCGGCTGAGT
CCGGCGGAGC GGCTGGCGTT TGTTTTGCAT GATTTGTTCG GGCAGTCGTT CGAGGAAGTT
GCGGCGGTGT TGGACAAGAC GCCGGCTGCG GTGCGTCAGC TTGCGAGCCG CGCGCGTCGC
AGAGTTCAGG GGCCGCCGGC GATTGAACGC AACGTGCTCA ATGCGCATCG GCGAGTGGTG
GATGCGTTTC TTACGGCGCT GCGGGCGAGG GACTTTGAAG GATTGGTCTC GGTGCTCGAT
CCTAATGTCG TGGTGCGCAT TGATAAATTC TCAGCGACTG GTGGGAAGGA CATCGAAATT
CGCGGCGCCG AAACTTGGGC TCGTGGTGCG ATCCAATTCG CAGAAGGCGC ACGCTTGGCG
CGGACGGTGT TGGTGGATGG CGAAGTCGGT GTGGTGATGG CGCCGCGTGG CAAACTATTC
CGGGCTGTGC GTCTCTCGAT TTCGGATGAG GGACGTATTC GCGAGGTAGA GATCATCGGG
GAGAAAGAGC GGCTCGCGGT GATGGATGTA CAGCTACTGC CGGAGTGA
 
Protein sequence
MDEKKFLAER FEENRSHLRA VAYRMLGSAT EAEDAVQEAW LRLNRADTQQ VNNLNAWLTT 
VVARVCLDTL RSRKARREEA LDTEVTEPVA STQKRNNPEQ EAMLADSVGI ALLVVLDRLS
PAERLAFVLH DLFGQSFEEV AAVLDKTPAA VRQLASRARR RVQGPPAIER NVLNAHRRVV
DAFLTALRAR DFEGLVSVLD PNVVVRIDKF SATGGKDIEI RGAETWARGA IQFAEGARLA
RTVLVDGEVG VVMAPRGKLF RAVRLSISDE GRIREVEIIG EKERLAVMDV QLLPE