Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_0004 |
Symbol | |
ID | 5320831 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | + |
Start bp | 3456 |
End bp | 4739 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640788935 |
Product | RNA polymerase ECF-subfamily sigma factor |
Protein accession | YP_001325699 |
Protein GI | 150395232 |
COG category | [K] Transcription |
COG ID | [COG4941] Predicted RNA polymerase sigma factor containing a TPR repeat domain |
TIGRFAM ID | [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.122751 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.000113113 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCGACA CAGCCTGGAT CGATCTCGCA CTGGTTTCCG CACGGCCACA GGCCATGGGT GCGTTGCTGC GATACTTTCG CAGCCTCGAC CTGGCGGAGG AGGCGTTCCA GGAAGCCTGC ATCCGGGCAC TGAAAAACTG GCCGCGTACC GGTCCTCCGC GCGATCCCGC CGCCTGGCTG ATCTTCGTCG GCCGCAACAG CGGCATCGAC CGGGTGCGCA GGCAGTCGCG CGAGACGGCA TTGCCGCCGG AGGAACTGCT CTCCGACCTT GGCGACAGGG AGAGCGAGCT CGCCGATCGC CTCGATGGCG CACACTATCG CGACGATATC CTGCGGCTGC TTTTCGTCTG CAGCAATCCG GCGCTTCCGG CGACCCAGCA GATCGCGCTT GCACTGCGCA TCGTATCGGG CCTTTCCGTC AGGCAGATAG CCCGTGCCTT TCTTGTCGGC GAGGCGGCGA TGGAGCAGCG CATCACCCGC GCCAAGGCGC GTGTTGCGGC CGCCGGCATC CCTTTCGAAA CGCCCGATGC CGCCGACCGG GCGGAGCGCC TCGCTGCAGT CGCGACGATG ATCTATCTTG TCTTCAACGA GGGCTACTCG GCGATGAACG GCCCCGAGGG TGTCTCCGCA GATCTCTGCG ACGAGGCGAT CCGCCTGTCC CGGCTGCTTC TTCGGCTGTT TCCGGCCGAG CCGGAGATCA TGGGGCTGGC GGCGCTTCTG CTCCTCCAGC ATTCGCGCGC CCGCGCTCGG TTCGATGCCG CTGGCGCCGT GGTCCTGCTC GAAGATCAGG ATCGGCAGCT CTGGAACCGG CCGATGATCA CCGAGGCGCT GGCGATGATC GACAAAGCGA TGCGCCACCG GCGTCCCGGT CCCTATCAGA TCCAGGCCGC AATCGCTGCC CTGCACGCCC GCGCCTCACG GCCGGAGGAA ACGGATTGGG AGGAAATAGA CCTCCTTTAC CAGGCGCTCG AACGCCTGCA GCCCTCACCC GTCGTGACCC TCAATCGCGC CGTCGCGGTT TCGAAACGCG AGGGGCCCGA GGCCGCGCTG GCGATGATTG AGCCTTTGGG CGAGCGGCTG TCCGGCTACT TCTATTATCA TGGGCTGCGC GGCGGCCTCC TGAAGCGGCT TGGCCTTGCA TGCGAGGCGC GCAAAGCTTT CAACCAGGCA ATCGCACTCG CCACCAACGC GGCCGAAGCG GCCTATATCC GGACCCAGCT CGATCACCTC GCGGCTGCGC CGATGCCGGA ACCTTCCTTC TCAAGTGATT GTCCTGGCGC GTAG
|
Protein sequence | MTDTAWIDLA LVSARPQAMG ALLRYFRSLD LAEEAFQEAC IRALKNWPRT GPPRDPAAWL IFVGRNSGID RVRRQSRETA LPPEELLSDL GDRESELADR LDGAHYRDDI LRLLFVCSNP ALPATQQIAL ALRIVSGLSV RQIARAFLVG EAAMEQRITR AKARVAAAGI PFETPDAADR AERLAAVATM IYLVFNEGYS AMNGPEGVSA DLCDEAIRLS RLLLRLFPAE PEIMGLAALL LLQHSRARAR FDAAGAVVLL EDQDRQLWNR PMITEALAMI DKAMRHRRPG PYQIQAAIAA LHARASRPEE TDWEEIDLLY QALERLQPSP VVTLNRAVAV SKREGPEAAL AMIEPLGERL SGYFYYHGLR GGLLKRLGLA CEARKAFNQA IALATNAAEA AYIRTQLDHL AAAPMPEPSF SSDCPGA
|
| |