Gene Psyc_0203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPsyc_0203 
SymbolrpoN 
ID3514539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePsychrobacter arcticus 273-4 
KingdomBacteria 
Replicon accessionNC_007204 
Strand
Start bp242863 
End bp244572 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content46% 
IMG OID637668890 
ProductRNA polymerase sigma-54 factor family protein 
Protein accessionYP_263510 
Protein GI71064783 
COG category[K] Transcription 
COG ID[COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog 
TIGRFAM ID[TIGR02395] RNA polymerase sigma-54 factor 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000389153 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000144237 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTATGT CGTTCGGATT GGCGACCACG CTAAGCATCT CACAAAAACT GACCCCGCAA 
ATGCAGCAAG CTATTAAATT GCTGCAGCTC TCTAGTCTTG AGCTTGCGCA GGAGGTTCAA
GCGAAGCTGG ATAGTAATCC ACTACTAGAA CGCATCGAAG ACGATGAAGA TGAGTATGAA
AGTGCCGATA AAAGTGCCGA TAAAGACGCG TTAGTTGAGT TTGGCGAGCC ACTAACGCTA
GACAGCTGGA ATCAAAACAC GGCTTCTGAC ACATTTTCCG TATCTTCTTC AAGTAATAAT
GAGGATGAGT TTGAGTATGA TGACAGTATT AGCGATAGCC TAGATAAATT GCAGCAAGAC
AGTTTTGATA CGGATGCCAT AGACAGTTAT ATGCTTGAGG ACTTCGACGG TTCAAATGAA
GAAAGCCTCA ACTATGATAG CCCTTATGCA GAATATGATG GTTTTGACAC AGCCAGTGCT
GGTACAGGCA GCCCGTCAGC GCGTCCAGAT TCTGATGACT TTGATAGCTA TCAAGGCAGT
ACCAATGCGA CGATTCAAGA CCATGTGCGC TGGCAGCTAA ATTTTAAGCG ACTCTCAGAG
ACAGATACGC TTATCGCTGA GTATCTGATG GACTCTATGG ACGATATGGG TTTTGTACGG
CTTGATATCG AGGAGTTGCT ACAAAGCTTT GATACCATTG CGAGTTTTTA TCAGTGGGAT
GAGCGTGTTG AACATGACGA AGTGATGGCC GTACTGCGTA TGATTCAATC CTGTGATCCG
CTGGGGGTTG GTGCGCGAAA TCTAAGTGAA TGCCTAGCGA TTCAGCTATC TAAGCTCGAT
ACTGACATAC CGTATCTCAA ACAAGCCCAA GCGCTCTTGT CAGCCAGTGA GCATCTGGTC
AGTAATAACA TTAAAGCCTT AACTGAGCGC ACAGGACTTG CTGCACAGGA GATTACCCCC
GCACTTATGC TGTTACGCAG TTTGAATCCT TCACCAGGAT TGCTGTTTCA GAGTCGCCAA
CCTGATTATA CTCAGTCGCC CGACAGTTAT GATATCCCTG ATGTACTGGT GACGCCTATC
CGTCGCCATG ATGCCAATCA AAACACCGAC ACATCTACCC AAGAGGATGG CTGGCAGGTA
CGTCTAAACC CTGAAACTTT GCCAAAACTA CGGGTCAATC AAGAATATGC CAACCTTGTT
AAGCGCGGTG ATAATAGCCC TGATAATCAA TATCTGCGTG AAAATCTCAC CGATGCGCGC
TTGTTTATTC GCAGTATCGA AGAGCGTAAT CAGAACCTTT TGAAGGTGGC GACCAGTATC
GTCCGCTACC AGCAAGCGTT TTTGCAGTAC GGTGCCACTG CCATGCAACC GCTTATTTTA
AAGGTAATTG CAGACGAAGT CGATTTACAC GAATCAACGG TCTCGCGCCT GACTACCAGT
AAGACCATTT TGACCCCACA AGGTCTGTTT TCGCTCAAGC ACTTCTTTTC ATCACATGTT
AGCAGCAGCG ATGGCGATAT CTCATCAATT GCTATCAGCG CGATGATTAA GCAGCTTATA
GCCGATGAGG AGCCCAAAAA GCCACTATCC GACAGTAAGA TAAAAAATTA TTTGTTAGCA
GAAGGTATCG ATATTGCCAG AAGAACAGTT GCCAAATATC GTGAAGCGAT GAGTATCGGC
TCATCTACTC AGCGCAAACA AAAATATTAA
 
Protein sequence
MSMSFGLATT LSISQKLTPQ MQQAIKLLQL SSLELAQEVQ AKLDSNPLLE RIEDDEDEYE 
SADKSADKDA LVEFGEPLTL DSWNQNTASD TFSVSSSSNN EDEFEYDDSI SDSLDKLQQD
SFDTDAIDSY MLEDFDGSNE ESLNYDSPYA EYDGFDTASA GTGSPSARPD SDDFDSYQGS
TNATIQDHVR WQLNFKRLSE TDTLIAEYLM DSMDDMGFVR LDIEELLQSF DTIASFYQWD
ERVEHDEVMA VLRMIQSCDP LGVGARNLSE CLAIQLSKLD TDIPYLKQAQ ALLSASEHLV
SNNIKALTER TGLAAQEITP ALMLLRSLNP SPGLLFQSRQ PDYTQSPDSY DIPDVLVTPI
RRHDANQNTD TSTQEDGWQV RLNPETLPKL RVNQEYANLV KRGDNSPDNQ YLRENLTDAR
LFIRSIEERN QNLLKVATSI VRYQQAFLQY GATAMQPLIL KVIADEVDLH ESTVSRLTTS
KTILTPQGLF SLKHFFSSHV SSSDGDISSI AISAMIKQLI ADEEPKKPLS DSKIKNYLLA
EGIDIARRTV AKYREAMSIG SSTQRKQKY