Gene Acid345_4314 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4314 
Symbol 
ID4071887 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5124565 
End bp5125842 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content61% 
IMG OID637986347 
ProductRNA polymerase ECF-subfamily sigma factor 
Protein accessionYP_593388 
Protein GI94971340 
COG category[K] Transcription 
COG ID[COG4941] Predicted RNA polymerase sigma factor containing a TPR repeat domain 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.241702 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGATCCA CGACATGCAT CGTGGCTGAC ACTCAGCACC AAATCGCGGG GATGGTGGAC 
CATCTGTTCC GCACCAGTGC GGGTCAGATG GTCTCGCACC TTACCCGTGT GCTCGGCCCG
GCCCACCTCG ATCTCGCGGA AGAAGCCGTA CAGGATGCGC TTGTGAAGGC CCTGCAAAGC
TGGCCTTTTG GCGGCGTTCC CAACAATCCC GGCGGCTGGC TCATGCAGGT TGCGCGCAAT
CGCGCGCTCG ACATCGTTCG TCACCGCGGA ATGGCCGCCG AGAAGACCGG AGAAATCGTC
GCAGAACTTA CCCGCGCCAA TCCAACCGGT GACATCGAAG TGCGCGACCA GTTTCGTGAC
GACGAGTTGC GCATGATCTT CCTTTGCTGC CATCCACTGA TTTCGCGCGA TGCCCGCGTT
GCGCTCAGTC TTAAGACGGT CAGCGGTTTC TCGATCGAGG AGATTTCTCG CGCGTTCCTC
GCCGATCCGC CAACCATTGC GCAGCGCCTG GTGCGCGCCA AGCGGCAGAT ACGCGACGCC
AACATCCGCT TCGATCTTCC GCCGCGTAAA GAACTTTCCG AGCGGCTCGA TTCCGTTCTC
GAAGTGATTT ACCTGCTCTT TAACGAAGGC TACACGGCAC ACGCCGGCGA TGACCTGGTG
CGGCAAGACC TTTGCGTGGA AGCCCTACGT CTCGCCATGT TGGTGGCGGC ATCGCCGGTA
TCGCAACCGC GCGCCAACGC GCTGGTCTCG TTGCTGGCGT TCCAGGCCGC TCGCCTTCCC
GCCCGTGTCG ATGACAAAGG CGAGTTGGTT CTTCTCGAGG ACCAGGACCG CAGCAAGTGG
GACCAGAACT TAATCGCATT CGGTTTCCAC GAGATCGTGA AGAGCGCGCA AGGACAGGCC
GTGTCTACGT ATCACATGCA GGCCGCGATC GCATCCATTC ACGCCCAAGC CAAAGACACT
GCCGGCACCG ACTGGCCGAA GATCCTCATC CTTTACGACG ACCTGATGGC ACTGAATCCC
TCAGCGATCA TCGCGCTGAA TCGGGCGATC GCCGTGTGGC GGGTCCACGG CGTTGTGGCG
GCAATGCGCG AAGTGGACCA AATCGCCCAC GAACCAGCAC TCGCCCACTA CTATCTTCTT
CCCGCCACTC GCGGACGCCT CCTCCTCGAA ATCGGCGACC GTACGGCAGC AGCCGAGTGC
TTCAGCGAAG CACTCAATCG CAAGTGCTCG GAACCGGAGC GCCGGTTCTT ACTGCGACAG
CTGAAGGAAT GCGAATAG
 
Protein sequence
MRSTTCIVAD TQHQIAGMVD HLFRTSAGQM VSHLTRVLGP AHLDLAEEAV QDALVKALQS 
WPFGGVPNNP GGWLMQVARN RALDIVRHRG MAAEKTGEIV AELTRANPTG DIEVRDQFRD
DELRMIFLCC HPLISRDARV ALSLKTVSGF SIEEISRAFL ADPPTIAQRL VRAKRQIRDA
NIRFDLPPRK ELSERLDSVL EVIYLLFNEG YTAHAGDDLV RQDLCVEALR LAMLVAASPV
SQPRANALVS LLAFQAARLP ARVDDKGELV LLEDQDRSKW DQNLIAFGFH EIVKSAQGQA
VSTYHMQAAI ASIHAQAKDT AGTDWPKILI LYDDLMALNP SAIIALNRAI AVWRVHGVVA
AMREVDQIAH EPALAHYYLL PATRGRLLLE IGDRTAAAEC FSEALNRKCS EPERRFLLRQ
LKECE