Gene Cwoe_4683 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_4683 
Symbol 
ID8735149 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp4990073 
End bp4991176 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content67% 
IMG OID646505312 
ProductRNA polymerase, sigma 70 subunit, RpoD subfamily 
Protein accessionYP_003396471 
Protein GI284046131 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.256379 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.534178 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGTAG CTGAACTTCA GGAACTCGAA GAGATCAAGG GCCTCGTCAA CCGCGGCACG 
CAGCTCGGCG TCCTGACGTA CGCCGAGATC GCGAGAGCGG TCAGCGAGCT CGATCTCGAC
GAGTCGGACG TCGAGGACCT GCACGGCTTC CTCGAGAGAG CCGAGATCGA GCTCGTCGAG
GAGATCGATC CGGCCACGAC GGCGAGCAAC CAGGTCGAAC GCGCGCCCGA CAGACGTCGC
GGCCGCCGCG CGAGAACCGC GCTCGACCTC AAGCCCGACA TGACGACCGA TTCCCTTCAG
CTGTTCCTGA AGGACATCGG CAAGGTGCGG CTGCTCACCG CCCAGGAGGA GGTCGACCTC
GCGAAGCGGA TCGAGCGCGG CGACCTCGAC GCGAAGCAGA AGATGGTCGA GTCGAACCTT
CGCCTCGTCG TCTCGATCGC GAAGAACTAC CGCAACCAGG GCCTGCCGTT CCTCGATCTG
ATCCAGGAGG GCACGCTCGG CCTCGTGCGC GCCGCGGAGA AGTTCGACTA CCGCAAGGGC
TTCAAGTTCT CGACCTACGC GACCTGGTGG ATCCGCCAGG CGATCGCGCG TGCGCTCGCC
GACAAGGCGC GCACGATCCG CATCCCGGTC CACGTCGTCG AGAAGCTGAA CAAGATCGGC
CGTGCCGAGC GCAAGCTCGT CACGGAGTTG GGCCGCGAGC CCACCGCCGA GGAGATCGCC
GACGTGACGG GGATCGACCC GGAGGAGGTC GACTCGATCA AGCGCTCCGC GCAGGCGCCG
GTCTCGCTGG AGAAGCCGGT CGGCGACGAG GAGGAGTCCG AGTTCGGCCA GTTCATCGCC
GACGAGCGCG CGGAGTCTCC CTACGAGCGG GCTGCCGAGA TCCTCACGAA GGAAGCCCTT
CGCGAGGCGC TCGAGAACCT CTCCTACCGC GAGCGCCGCG TGCTGGAGTT GCGCTACGGC
CTCGGCGGCG AGCATCCGCG CACGCTCGAC GAGGTCGGCC GCACGTTCAA CGTCACGCGC
GAGCGGATCC GCCAGATCGA GAACCAGTCG CTCAAGAAGC TGCAGTCGCT CGCGGAGGCG
CAGAAGCTCC GCGACGTCGC GTAG
 
Protein sequence
MSVAELQELE EIKGLVNRGT QLGVLTYAEI ARAVSELDLD ESDVEDLHGF LERAEIELVE 
EIDPATTASN QVERAPDRRR GRRARTALDL KPDMTTDSLQ LFLKDIGKVR LLTAQEEVDL
AKRIERGDLD AKQKMVESNL RLVVSIAKNY RNQGLPFLDL IQEGTLGLVR AAEKFDYRKG
FKFSTYATWW IRQAIARALA DKARTIRIPV HVVEKLNKIG RAERKLVTEL GREPTAEEIA
DVTGIDPEEV DSIKRSAQAP VSLEKPVGDE EESEFGQFIA DERAESPYER AAEILTKEAL
REALENLSYR ERRVLELRYG LGGEHPRTLD EVGRTFNVTR ERIRQIENQS LKKLQSLAEA
QKLRDVA