Gene Sbal223_4031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_4031 
Symbol 
ID7086250 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp4799559 
End bp4800548 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content48% 
IMG OID643462909 
ProductDNA-directed RNA polymerase subunit alpha 
Protein accessionYP_002359927 
Protein GI217975176 
COG category[K] Transcription 
COG ID[COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit 
TIGRFAM ID[TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00391144 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000000899958 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAGGGTT CTGTTACAGA ATTTCTTAAA CCGCGTCTCG TTGATATCGA GCAGGTTAAC 
TCAACACGTG CCAAGGTTAC ATTGGAACCA CTTGAGCGTG GTTTCGGCCA CACTTTAGGT
AACGCGTTGC GTCGCATCCT ATTGTCGTCT ATGCCCGGCT GCGCGGTTAC CGAAGTCGAG
ATTGACGGTG TACTGCACGA ATACAGCAGT AAGGAAGGCG TTCAAGAAGA TATCCTTGAG
ATCTTGTTAA ACCTGAAAGG GTTAGCAGTG ACTATCGAGG GTAAAGACGA GGCTATGCTT
ACGTTGAGCA AGTCCGGCGC AGGCCCTGTC ATCGCAGCAG ATATCACGCA TGATGGTGAT
GTCACTATCG TGAATCCTGA TCATATTATC TGTCACCTGA CAGGTAACAA TGATATCAGC
ATGCGTATTC GCGTTGAGCG TGGTCGTGGC TATGTACCAG CATCTGCTCG TGCACAGACT
GAAGACGATG ATCGCCCAAT CGGCCGCTTG CTGGTTGATG CTTCTTTCTC GCCAGTTGCA
CGTATTGCCT ACAATGTAGA AGCAGCACGT GTAGAACAGC GTACTGACTT AGATAAACTC
GTTATCGATA TGACCACAAA CGGTACTATC GATCCTGAGG AAGCTATCCG TCGTTCTGCA
ACTATTCTGG CTGAACAGCT AGATGCGTTT GTTGAATTAC GTGACGTGAC TGAGCCAGAG
CTGAAAGAAG AGAAACCGGA ATTCGATCCG ATTCTGCTGC GTCCTGTCGA CGATTTAGAG
CTAACTGTAC GTTCGGCTAA CTGCTTGAAA GCCGAAGCGA TTCATTACAT CGGAGATCTG
GTACAGCGCA CTGAAGTTGA GTTGCTGAAG ACCCCTAACT TAGGTAAGAA ATCTCTTACT
GAAATTAAGG ACGTTTTAGC TTCTCGCGGA CTGTCGTTAG GTATGCGTTT GGAAAATTGG
CCTCCAGCCA GTTTAGCAGA CGACCTATAA
 
Protein sequence
MQGSVTEFLK PRLVDIEQVN STRAKVTLEP LERGFGHTLG NALRRILLSS MPGCAVTEVE 
IDGVLHEYSS KEGVQEDILE ILLNLKGLAV TIEGKDEAML TLSKSGAGPV IAADITHDGD
VTIVNPDHII CHLTGNNDIS MRIRVERGRG YVPASARAQT EDDDRPIGRL LVDASFSPVA
RIAYNVEAAR VEQRTDLDKL VIDMTTNGTI DPEEAIRRSA TILAEQLDAF VELRDVTEPE
LKEEKPEFDP ILLRPVDDLE LTVRSANCLK AEAIHYIGDL VQRTEVELLK TPNLGKKSLT
EIKDVLASRG LSLGMRLENW PPASLADDL