Gene Shewmr4_0224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_0224 
Symbol 
ID4250682 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp241589 
End bp242578 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content49% 
IMG OID638116776 
ProductDNA-directed RNA polymerase subunit alpha 
Protein accessionYP_732362 
Protein GI113968569 
COG category[K] Transcription 
COG ID[COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit 
TIGRFAM ID[TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00010279 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000000313535 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGCAGGGTT CTGTTACAGA ATTTCTTAAA CCGCGTCTCG TTGATATCGA GCAGGTTAAC 
TCAACACGTG CCAAGGTTAC TTTGGAACCG CTTGAGCGTG GTTTCGGCCA CACTTTAGGT
AACGCGTTGC GTCGCATCCT ATTGTCGTCT ATGCCCGGCT GCGCGGTTAC CGAAGTCGAG
ATTGACGGCG TGCTGCACGA ATACAGCAGT AAGGAAGGCG TACAAGAAGA TATCCTTGAA
ATCTTGCTGA ACCTGAAAGG GTTAGCAGTG ACTATCGAGG GTAAAGACGA GGCTATGCTT
ACATTAAGCA AGTCCGGCGC AGGCCCTGTC ATCGCAGCAG ATATCACGCA TGATGGTGAT
GTCACTATCG TGAATCCTGA TCATGTTATC TGTCATTTAA CAGGTAACAA TGATATCAGC
ATGCGTATTC GCGTTGAGCG TGGTCGTGGT TATGTGCCAG CATCTGCTCG TGCACAGACT
GAAGACGATG ACCGCCCAAT CGGCCGCTTG CTGGTTGATG CTTCTTTCTC GCCAGTTGCA
CGTATTGCCT ACAATGTAGA AGCAGCACGT GTGGAACAGC GTACTGACTT GGATAAACTC
GTTATCGATA TGACCACTAA CGGTACTATC GATCCTGAGG AAGCTATCCG TCGTTCTGCA
ACCATCTTAG CTGAACAGCT GGATGCGTTC GTTGAGCTGC GCGACGTGAC TGAGCCAGAA
ATGAAAGAAG AGAAGCCGGA GTTCGATCCG ATTCTGTTGC GTCCTGTCGA CGATTTAGAG
CTAACTGTAC GTTCGGCTAA CTGCTTGAAA GCCGAAGCGA TTCATTACAT CGGAGATCTG
GTACAGCGTA CTGAAGTTGA GCTGCTGAAG ACCCCTAACT TAGGTAAGAA ATCTCTTACT
GAAATTAAGG ACGTTTTAGC TTCTCGCGGA CTGTCGTTAG GTATGCGTCT GGAAAACTGG
CCTCCAGCTA GTTTAGCAGA CGACCTATAA
 
Protein sequence
MQGSVTEFLK PRLVDIEQVN STRAKVTLEP LERGFGHTLG NALRRILLSS MPGCAVTEVE 
IDGVLHEYSS KEGVQEDILE ILLNLKGLAV TIEGKDEAML TLSKSGAGPV IAADITHDGD
VTIVNPDHVI CHLTGNNDIS MRIRVERGRG YVPASARAQT EDDDRPIGRL LVDASFSPVA
RIAYNVEAAR VEQRTDLDKL VIDMTTNGTI DPEEAIRRSA TILAEQLDAF VELRDVTEPE
MKEEKPEFDP ILLRPVDDLE LTVRSANCLK AEAIHYIGDL VQRTEVELLK TPNLGKKSLT
EIKDVLASRG LSLGMRLENW PPASLADDL