Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_6232 |
Symbol | |
ID | 5320534 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | + |
Start bp | 1153011 |
End bp | 1154636 |
Gene Length | 1626 bp |
Protein Length | 541 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640777836 |
Product | transcriptional regulator NifA |
Protein accession | YP_001314768 |
Protein GI | 150378173 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains |
TIGRFAM ID | [TIGR01817] Nif-specific regulatory protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCAAAC AGGACAAGCG GTCCGCCGAA ATTTACAGCA TATCAAAGGC TCTGATTGCC CCCACTCGTC TTGACATCAC GCTGAACAAT TTCGTGAATA CCCTCTCTTT GGTTCTGCGC ATGCGCCGCG GCGGACTCGA GATTCCGGCG TCGGAAGGAG AGGCAAAGAT CACAGCGGCT ACCCGCAGCA GTGGGTCTCC TTCTGCCGCT GACTATACTG TACCAAAGGC CGCAATAGAC CAAATCATCA CTGCCGGGCG GCTGGTCGTA CCAGACGTTT GCAACTCGGA GCTGTTCAAG GATCAGATAA AATGGCACGG AATTGGTCCG ACGGCCTTCA TCGCTGCGGC GGTGGAGGTC GATCACGAAA CGGGCGGAAT GCTGTGGTTC GAGCGCGCCA AAGAGTCCGG TTATGACTAT GAAGAGGAGG TCCACTTGCT TTCCATGGCC GCCGATCTTG CGGGCAGGGC CATTCGGCTT CATCGCACTA TCAGCAGGCG TGGGCGGACA TTTGCCGAAG AGCAGCAAGA ACAGCAGAAT TCACATGATG AGCAGAGCCA GGGTTTCGCC CGCCAGCGGC TGCTCAAGAA TGACGGGATC ATCGGGGAAA GTCCCGCCCT CATGGCGGCG GTAGAAACCG CCAAAGTCGT GGCAGAGACC AATTCAACCG TTCTCCTCAG GGGCGAAAGC GGAACTGGCA AGGAATGCTT TGCGAAGCTA ATTCACCAGC ATTCGACTCG GCAAAAAAAG CCGTTCATTA AGTTCAATTG CCCCGCGCTG TCTGAGAGCC TTCTCGAATC AGAGCTGTTT GGACATGAGA AAGGTGCCTT CACCGGGGCT ATTGCTCAAC GAGTAGGCCG TTTCGAATCG GCGAATGGCG GAACGTTGCT GCTCGATGAA ATCGGCGAGA TCCCGCCGGC ATTCCAAGCA AAACTGCTAC GCGTAATACA GGAAGGTGAA TTTGAACGAG TCGGCGGCAC AAAGACGCTG AAAGTCGACA TCCGGCTCAT ATTCGCCACA AATAAGGATC TCGAAATGGC GGTCCGGAAT GGGGAGTTCA GGGAAGACCT TTACTACCGC ATCAGTGTGG TGCCCATAAC TTTGCCGCCG CTTAGGAAAC GCGACGGTGA CATTCCGCTC CTTGCAAAAG CTTTCCTTCA GCGGTTCAAT GAAGAGAACG GTCGTGGTCT CCATTTCGTG CCGTCTGCGC TTGACCACTT GTCGAGGTGC AAGTTCCCCG GAAACGTTCG CGAACTGGAA AACTGTGTGC GGAGGACTGC AACTCTCGCC AGGGCAAAGA CGATCACTTC GTCAGATTTC GCCTGCCAGA CGGACCAGTG CTTTTCTTCT CGCCTCTGGA AAGGCATTCG CTGTTCGCAT TGCCACATTG CCACCGATGC GTCCGCGGGT ACGACACCGT TGGTAGGAGC GCCAGCCAAT GATGTTCCGC CGAAGGATCC CGCATCCGCA GGGGCGGCAT CCAATCTGAT CGAGCGCGAC CGGTTGATCA GTGCGCTGGA GGAGGCCGGT TGGAATCAGG CAAAGGCAGC TCGCATCCTC GAAAAAACGC CCCGGCAGGT CGGGTATGCT ATACGTCGGC ATGGTGTAGA GGTGAGAAAG CTCTAA
|
Protein sequence | MRKQDKRSAE IYSISKALIA PTRLDITLNN FVNTLSLVLR MRRGGLEIPA SEGEAKITAA TRSSGSPSAA DYTVPKAAID QIITAGRLVV PDVCNSELFK DQIKWHGIGP TAFIAAAVEV DHETGGMLWF ERAKESGYDY EEEVHLLSMA ADLAGRAIRL HRTISRRGRT FAEEQQEQQN SHDEQSQGFA RQRLLKNDGI IGESPALMAA VETAKVVAET NSTVLLRGES GTGKECFAKL IHQHSTRQKK PFIKFNCPAL SESLLESELF GHEKGAFTGA IAQRVGRFES ANGGTLLLDE IGEIPPAFQA KLLRVIQEGE FERVGGTKTL KVDIRLIFAT NKDLEMAVRN GEFREDLYYR ISVVPITLPP LRKRDGDIPL LAKAFLQRFN EENGRGLHFV PSALDHLSRC KFPGNVRELE NCVRRTATLA RAKTITSSDF ACQTDQCFSS RLWKGIRCSH CHIATDASAG TTPLVGAPAN DVPPKDPASA GAASNLIERD RLISALEEAG WNQAKAARIL EKTPRQVGYA IRRHGVEVRK L
|
| |