Gene Cwoe_1366 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_1366 
Symbol 
ID8731805 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp1432171 
End bp1433172 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content68% 
IMG OID646501984 
ProductDNA-directed RNA polymerase, alpha subunit 
Protein accessionYP_003393170 
Protein GI284042830 
COG category[K] Transcription 
COG ID[COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit 
TIGRFAM ID[TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGACT TTCAAGTCCC CCGAATCACC ACTGAGACCG TTGAGGAGAA CCGCGGCTCC 
TTCACGATCG AGCCGCTCGA TCGCGGGTTC GGCTACACGT TCGGCAACTC GCTTCGCCGT
GTCCTGCTCT CTTCGCTCGC TGGCGCCGCG GTCACGAGCG TGCGGATCGA GGGCGTCGCG
CACGAGTTCT CCACGATTCC GGGTGTGAAG GAGGACGTGA CCGACATCGT CCTCAACCTC
AAGGGCATCG TCTGCCGCAT GCACTCCGAC GCGACCGAGA TCGAGGCCCC GCTCGTCGTC
ACGGGCCCCG GCGACATCAC CGCGAAGGAC ATCGACCTGC CCTCCGGCGT CGAGATCCTG
AACCCGGACG CGCACATCGC GACGCTCGAG AAGAAGACGA AGCTCGAGGT GTACCTGACG
ATCGGCCGCG GCCGCGGCTA CCGCCCGGCC GAGGAGAACA AGTCGCCCGA TCAGCCGATC
GGCGTGATCC CGATCGACTC GATCTTCTCG CCGGTCCGTC GCGTCGCCTA CGCGGTCGAG
CAGGCCCGCG TCGGCCAGAA GACCGACTAC GACAAGCTGA CGCTCGACAT CGAGACCGAC
GGCTCGATCG ACCCGCACGC CGCGCTGCGC GAGGCGGCCG AGATCCTGAT CTCGCAGCTG
GCGATCTTCA CCGACGCTGA CCGCGTCATC GAACTGCGCG ACACCGGGGG CCTCGGCGCG
CTCGAGCCCG GCCTCGCCGG TGGCGGCGTC GGCGGCGTGG GCGGGCACGG TGCCGGCCCC
GGCGGCCGTC CCGCGAACGC GATGGACGAC ATCCTGATCG AGGAGCTGGA GCTGGGCGTG
CGCTCGTACA ACTGCCTCAA GCGCGCGGGC ATCCAGACCG TCGGCGACCT CATCTCCAAG
ACCGAGAACG AGCTGAACGC GATCCCGAAC TTCGGCAAGA AGTCGATCGA CGAGGTCATC
GAGACGCTCG AAGGGCGCGG GCTCCACCTG CGCCAGGACT AA
 
Protein sequence
MLDFQVPRIT TETVEENRGS FTIEPLDRGF GYTFGNSLRR VLLSSLAGAA VTSVRIEGVA 
HEFSTIPGVK EDVTDIVLNL KGIVCRMHSD ATEIEAPLVV TGPGDITAKD IDLPSGVEIL
NPDAHIATLE KKTKLEVYLT IGRGRGYRPA EENKSPDQPI GVIPIDSIFS PVRRVAYAVE
QARVGQKTDY DKLTLDIETD GSIDPHAALR EAAEILISQL AIFTDADRVI ELRDTGGLGA
LEPGLAGGGV GGVGGHGAGP GGRPANAMDD ILIEELELGV RSYNCLKRAG IQTVGDLISK
TENELNAIPN FGKKSIDEVI ETLEGRGLHL RQD