Gene Spro_4300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_4300 
Symbol 
ID5604408 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp4766903 
End bp4768846 
Gene Length1944 bp 
Protein Length647 aa 
Translation table11 
GC content54% 
IMG OID640939860 
ProductRNA polymerase sigma factor RpoD 
Protein accessionYP_001480522 
Protein GI157372533 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCCAC AGTGGGCCGC GGCAACAACA AAAACGCCCC AAATGCTATT GTTGGCGGTT 
TCGCCGACCG ACACCAACCC CAATACTCTG AAGTGTGGAT ACCGTCTTAT GGAGCAAAAC
CCGCAGTCAC AGCTAAAGCT ACTTGTCACC CGTGGTAAGG AGCAAGGCTA TCTGACCTAT
GCTGAGGTCA ATGACCATCT GCCGGAAGAT ATCGTCGACT CCGACCAGAT CGAAGACATC
ATCCAGATGA TTAACGACAT GGGCATCCAG GTGCTGGAAG AAGCACCGGA CGCCGATGAC
CTGATGCTGG CCGAGAATAC CACCGACACC GATGATGATG CGGCGGAAGC GGCTGCTCAA
GTGTTGTCCA GCGTTGAGTC TGAAATTGGC CGTACCACCG ACCCGGTGCG CATGTACATG
CGTGAAATGG GGACCGTTGA ACTGCTGACG CGCGAAGGCG AAATCGACAT CGCCAAACGC
ATTGAAGACG GCATCAACCA GGTTCAGTGC TCGGTTGCCG AGTACCCGGA AGCCATTACC
TATTTGCTGG AGCAGTACGA TCGCGTCGAA GCGGGCGAAG CTCGCCTGTC CGATCTGATC
ACCGGCTTCG TCGATCCTAA CGCGGAAGAA GACATCGCTC CTACCGCAAC GCACGTCGGT
TCTGAGCTGT CGACCGAAGA GCAGAAAGAT GACGAAGAAG AAGACGACGA AGACGAAGAA
GAAGATGACA ACAGCATCGA TCCAGAGCTG GCTCGTCAGA AATTCGCCGA TCTGCGCGAT
CAGTATGAAG CGACTCGTGT AGTCATCAAA AAGAACGGCC GCAGCCACGC CAGCGCAGCA
GAAGAAATTC TGAAGCTGTC TGAAGTGTTC AAGCAGTTCC GCCTGGTGCC AAAACAGTTC
GACTTCCTGG TCAATAGCAT GCGCATCATG ATGGACCGCG TTCGTACTCA AGAACGTATC
ATCATGAAGC TGTGCGTTGA GCAGTGCAAA ATGCCGAAGA AAAACTTCGT TACCCTGTTC
TCCAGCAACG AAACCAGTGA TACCTGGTTC GCAGCAGCGC TGGCAATGGC CAAGCCATGG
TCAGAAAAGC TCAAAGACGT CGCGGATGAC GTGCAGCGCA GCCTGCAGAA ACTGCGTCAG
ATCGAAGAAG AGACCGGCCT GACCATCGAG CAGGTGAAGG ACATTAACCG TCGCATGTCT
ATCGGTGAAG CGAAAGCCCG CCGTGCGAAG AAAGAGATGG TTGAAGCCAA CTTGCGTCTG
GTTATTTCTA TCGCCAAGAA ATACACCAAC CGCGGTTTGC AGTTCCTGGA TCTGATCCAG
GAAGGCAACA TCGGTTTGAT GAAAGCGGTA GACAAGTTTG AATACCGCCG TGGTTATAAG
TTCTCAACTT ACGCCACCTG GTGGATCCGT CAGGCTATCA CCCGCTCTAT CGCCGACCAG
GCGCGTACCA TCCGTATTCC GGTGCATATG ATTGAGACCA TCAACAAACT CAACCGTATT
TCGCGCCAGA TGCTGCAAGA GATGGGCCGC GAACCGACGC CGGAAGAGCT GGCTGAACGC
ATGCTGATGC CGGAAGACAA AATCCGCAAA GTGCTGAAGA TCGCCAAAGA GCCGATCTCC
ATGGAAACGC CGATCGGTGA TGATGAAGAT TCACACTTGG GCGATTTCAT CGAGGATACC
ACCCTCGAGC TGCCGTTGGA TTCTGCTACT TCGGAGAGCC TGCGCTCTGC CACGCACGAC
GTTCTGGCCG GCCTGACCGC CCGTGAAGCG AAAGTGCTGC GTATGCGTTT CGGTATCGAT
ATGAATACTG ACCACACGCT GGAAGAAGTG GGCAAACAGT TCGACGTTAC CCGTGAGCGT
ATTCGTCAGA TCGAAGCCAA AGCGCTGCGT AAACTGCGTC ACCCAAGCCG TTCCGAAGTG
CTGCGTAGCT TCCTGGACGA CTAA
 
Protein sequence
MPPQWAAATT KTPQMLLLAV SPTDTNPNTL KCGYRLMEQN PQSQLKLLVT RGKEQGYLTY 
AEVNDHLPED IVDSDQIEDI IQMINDMGIQ VLEEAPDADD LMLAENTTDT DDDAAEAAAQ
VLSSVESEIG RTTDPVRMYM REMGTVELLT REGEIDIAKR IEDGINQVQC SVAEYPEAIT
YLLEQYDRVE AGEARLSDLI TGFVDPNAEE DIAPTATHVG SELSTEEQKD DEEEDDEDEE
EDDNSIDPEL ARQKFADLRD QYEATRVVIK KNGRSHASAA EEILKLSEVF KQFRLVPKQF
DFLVNSMRIM MDRVRTQERI IMKLCVEQCK MPKKNFVTLF SSNETSDTWF AAALAMAKPW
SEKLKDVADD VQRSLQKLRQ IEEETGLTIE QVKDINRRMS IGEAKARRAK KEMVEANLRL
VISIAKKYTN RGLQFLDLIQ EGNIGLMKAV DKFEYRRGYK FSTYATWWIR QAITRSIADQ
ARTIRIPVHM IETINKLNRI SRQMLQEMGR EPTPEELAER MLMPEDKIRK VLKIAKEPIS
METPIGDDED SHLGDFIEDT TLELPLDSAT SESLRSATHD VLAGLTAREA KVLRMRFGID
MNTDHTLEEV GKQFDVTRER IRQIEAKALR KLRHPSRSEV LRSFLDD