Gene Smed_0004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0004 
Symbol 
ID5320831 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3456 
End bp4739 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content66% 
IMG OID640788935 
ProductRNA polymerase ECF-subfamily sigma factor 
Protein accessionYP_001325699 
Protein GI150395232 
COG category[K] Transcription 
COG ID[COG4941] Predicted RNA polymerase sigma factor containing a TPR repeat domain 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.122751 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000113113 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCGACA CAGCCTGGAT CGATCTCGCA CTGGTTTCCG CACGGCCACA GGCCATGGGT 
GCGTTGCTGC GATACTTTCG CAGCCTCGAC CTGGCGGAGG AGGCGTTCCA GGAAGCCTGC
ATCCGGGCAC TGAAAAACTG GCCGCGTACC GGTCCTCCGC GCGATCCCGC CGCCTGGCTG
ATCTTCGTCG GCCGCAACAG CGGCATCGAC CGGGTGCGCA GGCAGTCGCG CGAGACGGCA
TTGCCGCCGG AGGAACTGCT CTCCGACCTT GGCGACAGGG AGAGCGAGCT CGCCGATCGC
CTCGATGGCG CACACTATCG CGACGATATC CTGCGGCTGC TTTTCGTCTG CAGCAATCCG
GCGCTTCCGG CGACCCAGCA GATCGCGCTT GCACTGCGCA TCGTATCGGG CCTTTCCGTC
AGGCAGATAG CCCGTGCCTT TCTTGTCGGC GAGGCGGCGA TGGAGCAGCG CATCACCCGC
GCCAAGGCGC GTGTTGCGGC CGCCGGCATC CCTTTCGAAA CGCCCGATGC CGCCGACCGG
GCGGAGCGCC TCGCTGCAGT CGCGACGATG ATCTATCTTG TCTTCAACGA GGGCTACTCG
GCGATGAACG GCCCCGAGGG TGTCTCCGCA GATCTCTGCG ACGAGGCGAT CCGCCTGTCC
CGGCTGCTTC TTCGGCTGTT TCCGGCCGAG CCGGAGATCA TGGGGCTGGC GGCGCTTCTG
CTCCTCCAGC ATTCGCGCGC CCGCGCTCGG TTCGATGCCG CTGGCGCCGT GGTCCTGCTC
GAAGATCAGG ATCGGCAGCT CTGGAACCGG CCGATGATCA CCGAGGCGCT GGCGATGATC
GACAAAGCGA TGCGCCACCG GCGTCCCGGT CCCTATCAGA TCCAGGCCGC AATCGCTGCC
CTGCACGCCC GCGCCTCACG GCCGGAGGAA ACGGATTGGG AGGAAATAGA CCTCCTTTAC
CAGGCGCTCG AACGCCTGCA GCCCTCACCC GTCGTGACCC TCAATCGCGC CGTCGCGGTT
TCGAAACGCG AGGGGCCCGA GGCCGCGCTG GCGATGATTG AGCCTTTGGG CGAGCGGCTG
TCCGGCTACT TCTATTATCA TGGGCTGCGC GGCGGCCTCC TGAAGCGGCT TGGCCTTGCA
TGCGAGGCGC GCAAAGCTTT CAACCAGGCA ATCGCACTCG CCACCAACGC GGCCGAAGCG
GCCTATATCC GGACCCAGCT CGATCACCTC GCGGCTGCGC CGATGCCGGA ACCTTCCTTC
TCAAGTGATT GTCCTGGCGC GTAG
 
Protein sequence
MTDTAWIDLA LVSARPQAMG ALLRYFRSLD LAEEAFQEAC IRALKNWPRT GPPRDPAAWL 
IFVGRNSGID RVRRQSRETA LPPEELLSDL GDRESELADR LDGAHYRDDI LRLLFVCSNP
ALPATQQIAL ALRIVSGLSV RQIARAFLVG EAAMEQRITR AKARVAAAGI PFETPDAADR
AERLAAVATM IYLVFNEGYS AMNGPEGVSA DLCDEAIRLS RLLLRLFPAE PEIMGLAALL
LLQHSRARAR FDAAGAVVLL EDQDRQLWNR PMITEALAMI DKAMRHRRPG PYQIQAAIAA
LHARASRPEE TDWEEIDLLY QALERLQPSP VVTLNRAVAV SKREGPEAAL AMIEPLGERL
SGYFYYHGLR GGLLKRLGLA CEARKAFNQA IALATNAAEA AYIRTQLDHL AAAPMPEPSF
SSDCPGA