Gene Acid345_4296 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4296 
Symbol 
ID4071869 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5103453 
End bp5105177 
Gene Length1725 bp 
Protein Length574 aa 
Translation table11 
GC content57% 
IMG OID637986329 
Productsigma 38, RpoS 
Protein accessionYP_593370 
Protein GI94971322 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.939988 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCTCTCG ACGACAAGTA CGACGATATC AAAAAACTGA TTGATACCGG CAAGGAAAAG 
GGTTACCTGA CGTACAGCGA GGTGAATGAT CTCATCCCGC ACGACGTTCA CTCGCCCGAC
GATCTTGATG ATCTCCTGAC GACGATCGGT ACGCAAGGCA TTGACGTTCT CGAAGGTCCG
GGAAAACTTC CCTCCGCCGC AGTTCTCGAC AAGCGCTATG ACGATGTCGA GGCCGGCGAA
GAGGAGATGG AACTCGACCT CACTCCGGGA GCGCTCGAAA AGACGAATGA CCCCGTGCGC
ATGTATTTGC GCGAAATGGG CACGGTACCG CTGCTGACAC GTGAAGGCGA AGTTGAAATC
GCCAAGCGTA TCGAGCGCGG ACAACTTCGC GTATTAAAAG CAATCTCGCG TTCTCCCATC
GTCATTCGCG ACATCATCGC GATTGGCGAA GACCTGAAGC GCGGCGTGCG CAGCATCAAG
GAAATCGTGA TCTTCGATGA AGAAGAAATC ACGGATGAAG TCCTTGCTGC TCGCCTGAAA
GACACCACCG GGCGCATCGA TGAGCTGAAC AAGCACTACA AGAAGAGTTC CCAGCTCGAG
CAGAAGCTTG AAGAGATTGC TCCCGGCGGC GTTAAGGAAA TCAAGGACAA GAAAAAAGCA
CGCGACGTGC GCAAGGTCCG TTGGACGCTG GGCCGCGAAT TGGTCTCGAT CTCGCGCATC
ATTCGCAAGA TCAACTTCAC CAACGTCGAA CGCAAGCGCT TGATCGATCG CGTGAGCAAG
ACGGTCGAGA ACCTGCGCAT TCTCGAGCGC CAGGTTTCGC ATCTCGAGCA TCGTGCCAAC
GAAACGCGTT CGGAAGAGAC GAAGAAAGAG CTCAAGAAGC AGAGCCGCAC CCTTAAGGGC
GACCTGGAGC GCATGGAGCA GGAGGCTGGC GTTTCCATCG CCGAGCTGAA GCGTACCCAG
CGCGAAATTA TCCAGGGAGA CATGGATGCC GAGCAGGCGA AGAAGGAGCT CATCGAAGCT
AACCTTCGAC TCGTCGTCTC GATCGCGAAG AAGTACACCA ACCGCGGACT CCAGTTCCTC
GACCTCATCC AGGAAGGCAA CATCGGCCTG ATGAAAGCCG TGGACAAGTT CGAGTACCGC
CGTGGCTACA AGTTCTCAAC GTACGCCACG TGGTGGATTC GCCAGGCCAT TACACGCGCG
ATTGCCGATC AGGCCCGCAC CATCCGTATT CCGGTGCACA TGATCGAAAC CATCAACAAG
CTCATCCGCA CCTCGCGTCA ACTGGTGCAG GAACTTGGGC GTGAACCGAG CAGCGAAGAA
ATCGCCAAGC GGATGGATAT CCCCGTGGCG AAGGTCCGCA AAGTGCTGAA GATCGCACAG
GAACCGATCT CGCTCGAAAC ACCGATCGGC GAAGAGGAAG ATTCACACCT TGGCGATTTC
ATCGAGGACC GCTCGATGGT TTCGCCGGCC GAGGCCGTCA TCAACGTGAA CCTCAAGGAC
CAGACAGCCC AGGTCCTGCG CACGCTCACC GCGCGCGAAG AAAAGGTCAT CAAGATGCGG
TTCGGACTCG AAGACGGTTC AGAGCACACG CTCGAGGAAG TCGGCCAGTC GTTCGCCGTT
ACACGCGAAC GCATCCGCCA AATCGAGGCG AAGGCGTTGC GCAAGCTGCG TCATCCGTCA
CGCTCGCGGA AGCTGCGGGC ATTTCTCGAT GGAGTGCGCG ACTAG
 
Protein sequence
MALDDKYDDI KKLIDTGKEK GYLTYSEVND LIPHDVHSPD DLDDLLTTIG TQGIDVLEGP 
GKLPSAAVLD KRYDDVEAGE EEMELDLTPG ALEKTNDPVR MYLREMGTVP LLTREGEVEI
AKRIERGQLR VLKAISRSPI VIRDIIAIGE DLKRGVRSIK EIVIFDEEEI TDEVLAARLK
DTTGRIDELN KHYKKSSQLE QKLEEIAPGG VKEIKDKKKA RDVRKVRWTL GRELVSISRI
IRKINFTNVE RKRLIDRVSK TVENLRILER QVSHLEHRAN ETRSEETKKE LKKQSRTLKG
DLERMEQEAG VSIAELKRTQ REIIQGDMDA EQAKKELIEA NLRLVVSIAK KYTNRGLQFL
DLIQEGNIGL MKAVDKFEYR RGYKFSTYAT WWIRQAITRA IADQARTIRI PVHMIETINK
LIRTSRQLVQ ELGREPSSEE IAKRMDIPVA KVRKVLKIAQ EPISLETPIG EEEDSHLGDF
IEDRSMVSPA EAVINVNLKD QTAQVLRTLT AREEKVIKMR FGLEDGSEHT LEEVGQSFAV
TRERIRQIEA KALRKLRHPS RSRKLRAFLD GVRD