Gene Acid345_0600 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0600 
Symbol 
ID4069633 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp730289 
End bp731335 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content59% 
IMG OID637982605 
ProductECF subfamily RNA polymerase sigma-24 factor 
Protein accessionYP_589679 
Protein GI94967631 
COG category[K] Transcription 
COG ID[COG1595] DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog 
TIGRFAM ID[TIGR00741] ribosomal subunit interface protein
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGTCC ATATCAGTTA CAAGGGTCTG GAAAAGACCC CCGACGTCGA ATCCACCGTC 
AAACTACACC TGAAAAAACT GGAGCGCCGA CTGCAGGTAT TCCGGCCCGA GCTGGTCAGC
CTGCATGGCA GTGTCCTGCA AAAGTCGCCA CGAGCGGGGT TTGTGGTCGC CTTGAACCTC
AAGCTCCCGA CGACCGACAT TGCGGCGGAA CAGGCGAACC CAAATTCAGC AGTCGCGGTG
AAGGCCGCGT TCGATTCCCT CATCGGCCAG ATCTCGCGCC ACAAAGGTGC GCTGCGCAAC
GAGCACGCCT GGCCGCGACG ACAACAGGAA CATGGTCGCA ATGCGATCAG CGAAGTTCCC
TTTGAAGACA CGGTCGCAGC GATCAAGCCA GAGAGCGTGA CCAACGAAGA TGTGAGCAGC
TACATCAACG CCAACCTGCC TCGACTGCGC CGGTTCGTGC AGCGCGAACT GCGGCATCGC
GAACAAGATG AGAAGATCGC GCGCGGCTCG ATCTCGGTTG ACGAAGTGAT TGACGAAGCC
ATCGGCAACG CGTTGAGTGA GGCCTTTGAG CGGCCCGAGA AGATGCGCCT CGAGCCATGG
CTCTATCGGC TGTCCACCGA TGCGATCGAT CGGCTCGCAG CCGGCGATTC CGGGGGCGGA
AATATTCCAC TGGACCGGCC CGACCGTTCG CGCGATGGCG AAGGCAGCGA CGAAAACGTG
CTCCAGTTCC ACCAACCCGA CGAAGATCTG TCGGCAATGA GCCTGACGTT CGACAAGAAC
ATTTCCACGC CCGAAGACCT CGCGGCGAAG GACGAGATGA TTTCGCTCGT CGAACGCACG
TTGCGCGATG CTGGGAGGAA CGAACGCGAG GCTTTCATTC TTTTCACCAT CGAGGGCTTC
ACCGTCGAAG AGATTGCAGA CATCACCCAG AAACCGGAAG AGGAGGTAAG AAAGAGCGTG
CATTCCGCCC GTGAGTACTT GAAAGAGTTT TTGCCCGTGC GGGATCCGCT GAGTGACCGT
TTGATCGAAC ACTCCAAAAC GGCCTAA
 
Protein sequence
MNVHISYKGL EKTPDVESTV KLHLKKLERR LQVFRPELVS LHGSVLQKSP RAGFVVALNL 
KLPTTDIAAE QANPNSAVAV KAAFDSLIGQ ISRHKGALRN EHAWPRRQQE HGRNAISEVP
FEDTVAAIKP ESVTNEDVSS YINANLPRLR RFVQRELRHR EQDEKIARGS ISVDEVIDEA
IGNALSEAFE RPEKMRLEPW LYRLSTDAID RLAAGDSGGG NIPLDRPDRS RDGEGSDENV
LQFHQPDEDL SAMSLTFDKN ISTPEDLAAK DEMISLVERT LRDAGRNERE AFILFTIEGF
TVEEIADITQ KPEEEVRKSV HSAREYLKEF LPVRDPLSDR LIEHSKTA