Gene Smal_3572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmal_3572 
Symbol 
ID6474452 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStenotrophomonas maltophilia R551-3 
KingdomBacteria 
Replicon accessionNC_011071 
Strand
Start bp4024096 
End bp4025952 
Gene Length1857 bp 
Protein Length618 aa 
Translation table11 
GC content62% 
IMG OID642732771 
ProductRNA polymerase sigma factor RpoD 
Protein accessionYP_002029954 
Protein GI194367344 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00998328 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.692055 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAACG AACGTCCTGC CCAATCCGAA ATCAAGCAAC TGATCAGCAA GGGCCTGGAA 
CAGGGCTACC TGACCTACGC CGAAGTCAAT GACCATCTGC CGGACGACAT GGTCGACCCG
GAACAGATCG AAGACATCAT CGGCATGATC AACGGCATGG GCATTGATGT CCATGAAGTT
GCGCCGGATG TGGAAACCCT GCTTCTCAAT GATGGCAACA CCGGTAACCG CGAAGTCGAT
GACACCGCGG CCGAAGAAGC TGCCGCCGCG CTGAGCGCAC TCGACACCGA AGGTGGCCGT
ACCACCGACC CGGTGCGCAT GTACATGCGC GAAATGGGCA CCGTCGAGCT GCTGACCCGC
GAAGGCGAAA TCGCCATCGC CAAGCGCATC GAGGAAGGCC TTTCGCAGGT GCAGGCCGCA
CTGGGCCAGT TCCCGGTGTC GGTCGAATCG CTGTTGAACG ATTACGAAGC CCACAAGGAA
GGCAAGAAGC GCCTGGCCGA AGTGATCGTC GGCTTCAACG ACCTGGCCGA AGAAGTTGCC
GCGCCGGCCG CGCCCGCTGC CTCCGATGAC GGTGACGACG CAGGCGCCGA CGCTGACGAG
GAAGAGGACG ATGATGTCGA CGGCGGCGAT GAAGAAGCCG CACCGACCGG TCCGGACCCG
GAGGAAGTCG CTGCGCGCAT GCAGGCGCTG AGCGATGCCT TCAATGCCTT CAAGAAGGCT
GTGGCCAAGG GCGACAAGAA GAGCCTGCTG AAGCTCCGCG AGGAGATGTC GGCGGTGTTC
GTCACCCTGA AGCTGCCGCT GCCGCTGACC GACGTGCTGA CCAAGCAGCT GCGCGACACC
ATGGCCGGCA TCAAGTCCCA TGAGCGCCGC GTGCTGAACC TGGCAACCGT CACCGCGCGC
ATGCCGCGCA AGGATTTCAT CCGCTCCTGG GAAGGCAACC AGACCAACCT GGAGTGGGTG
GAAGACGCAC TGAAGCGCAA GCAGAAGTGG TCTTCGGCCC TGCGTGAAGT GAAGGACCAG
ATCATCGCCG AACAGCAGGC GACGATCGAC ATCGAGAAGC TGACCCAGCT GGACCTGGAC
GATCTGAAGG AACTCAGCCG TGCCATGGCC TACGGCGAAG CCAAGGCGCG CAAGGCCAAG
AAGGAAATGG TCGAGGCCAA CCTGCGCCTG GTGATCTCGA TCGCCAAGAA GTACACCAAC
CGCGGCCTGC AGTTCCTCGA CCTGATCCAG GAAGGCAACA TCGGCCTGAT GAAGGCCGTG
GACAAGTTCG AATACCGTCG TGGCTACAAG TTCTCCACGT ATGCGACCTG GTGGATCCGT
CAGGCCATCA CCCGTTCGAT CGCCGATCAG GCGCGCACCA TCCGTATCCC GGTGCACATG
ATCGAAACGA TCAACAAGTT GAACCGCATT TCCCGCCAGA TGCTCCAGCA GTACGGCCGC
GAGGCTACGC CGGAGGAGCT GGCCAAGGAA ATGGACATGC CGGAAGACAA GATCCGCAAG
GTGATGAAGA TCGCCAAGGA GCCGATCTCG ATGGAAACCC CGATCGGCGA CGACGAGGAT
TCCCATCTGG GCGACTTCAT CGAGGACACC AACGTGGAGT CCCCGATCGA GAACACCACC
AACATCAACC TGTCTGAAAC CGTGCGCGAC GTGCTGGCCG GCCTCACCCC GCGTGAAGCC
AAGGTGCTGC GCATGCGCTT CGGCATCGAC ATGAACACCG ACCACACCCT CGAGGAAGTC
GGCAAGCAGT TCGACGTGAC TCGCGAGCGC ATCCGCCAGA TCGAAGCGAA GGCCCTGCGC
AAGCTGCGTC ACCCGAGCCG CTCGGAGCAG CTGCGCAGCT TCCTGGATAT CGACTGA
 
Protein sequence
MANERPAQSE IKQLISKGLE QGYLTYAEVN DHLPDDMVDP EQIEDIIGMI NGMGIDVHEV 
APDVETLLLN DGNTGNREVD DTAAEEAAAA LSALDTEGGR TTDPVRMYMR EMGTVELLTR
EGEIAIAKRI EEGLSQVQAA LGQFPVSVES LLNDYEAHKE GKKRLAEVIV GFNDLAEEVA
APAAPAASDD GDDAGADADE EEDDDVDGGD EEAAPTGPDP EEVAARMQAL SDAFNAFKKA
VAKGDKKSLL KLREEMSAVF VTLKLPLPLT DVLTKQLRDT MAGIKSHERR VLNLATVTAR
MPRKDFIRSW EGNQTNLEWV EDALKRKQKW SSALREVKDQ IIAEQQATID IEKLTQLDLD
DLKELSRAMA YGEAKARKAK KEMVEANLRL VISIAKKYTN RGLQFLDLIQ EGNIGLMKAV
DKFEYRRGYK FSTYATWWIR QAITRSIADQ ARTIRIPVHM IETINKLNRI SRQMLQQYGR
EATPEELAKE MDMPEDKIRK VMKIAKEPIS METPIGDDED SHLGDFIEDT NVESPIENTT
NINLSETVRD VLAGLTPREA KVLRMRFGID MNTDHTLEEV GKQFDVTRER IRQIEAKALR
KLRHPSRSEQ LRSFLDID