Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_3529 |
Symbol | |
ID | 6131776 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | - |
Start bp | 3937509 |
End bp | 3939335 |
Gene Length | 1827 bp |
Protein Length | 608 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641643698 |
Product | transcriptional regulator NifA |
Protein accession | YP_001770346 |
Protein GI | 170741691 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains |
TIGRFAM ID | [TIGR01817] Nif-specific regulatory protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0603248 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0582265 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGGAGC GCGTCGTCCC GGCGCGGCTC GCGCAAAGCC AGCCGCAACC GCCGCGGCCC GCGCCGCAGG CGCCGAGAGC GCGGGCCGCC CCGCCGAGCC CGCCCCCATC CCCCGCGCCG AGCCCCGCCG CGAGCCCGTC CCCGGCGCCC CGCCCGGCCG CGCGGGGCAC CTCGCCCGAG GTGGCGCTGA TCGGCATCTA CGAGATCTCC AAGGTGCTGA CCGCGGCGCG GCGCCTCGAA GTCACCCTCG CCAACGTGGT CAACATCCTG TCCTCGATGC TGAGCATGCA TCACGGCATG ATCGTCATCC TCGACCGGGA GGGCGGGCCG GACATCGTCT CCAGCACCGG CTGGACGCCG CAGGTCGCCC ACCACATCCG GGCGCGGGTG CCCCAGAAGG CGATCGACCA GATCGTCGCC ACCGCGACCC CGCTGGTGGT GCAGGACGTC GCCGCCGACC CGCTCTTCGC GGGCCATCTC GACCTGTTCG AGGATGCCGG CAAGGCCACG GTCTCGTTCA TCGGCGTGCC GATCAAGGCG GATTCGCGCG TCCTCGGCAC GCTCTCGATC GACCGGATCT GGGACGGCAG CGCGCGCTTC CGCTTCGACG AGGACACGCG CTTCCTCACC ATGGTGGCCA ACCTCGTCGG CCAAGCGGTC CGGCTGCACA ACACGGTGGC GCAGGACCGC GACCAGCTCA TCGCCAAGAC CCACCGGCTG GAGAAGGCGC TCGCCGAATC GAGCGCGACC TCCTCGTCGG TCGGCATCAT CGGCGAGAGC CAGAAGGTGA AGCGCCTCGT CGCGATGACG GAGGTCGCGG CGCGCTCCAA CACCACGGTC CTGCTGCGCG GCGAGAGCGG CACCGGCAAG GAACTCTTCG CGCGCGCCAT CCACGACCTC TCGCCCCGCA AGGGCAAGCC CTTCGTGCGG GTGAACTGCG CGGCGCTCGC CGAGAGCGTG CTCGAATCCG AGCTGTTCGG CCACGAGAAG GGGGCCTTCA CGGGCGCGGT CGGGACGCGC CAGGGCCGGT TCGAACTCGC CCATGGCGGG ACGCTCTTCC TCGACGAGAT CGGCGAGGTG AGCGCCACCT TCCAGGCCAA GCTGCTGCGC GTCCTGCAGG AGGGGGAGTT CGAGCGCGTG GGCGGCAACC GCACCCTGCG CGTCGACGTG CGGCTGGTCT TCGCGACCAA CCGCAACCTG GAGGAGGCGG TCGCCAAGGG CGATTTCCGC GCCGACCTCT ACTATCGCAT CAACGTGGTG TCGCTGCTGC TGCCGCCCCT GCGCGAGCGC CAGGGCGACA TCCCGGACCT CGCGAGGGCC TTCCTCTCCC GCTACAATGC CGAGAACAAG TCGAAGCTCG CCTTCGCGCA GACCGCGCTC GACGTGCTGC AGAAGTGCTA CTTTCCCGGC AACGTGCGCG AACTTGAGAA TTGCGTCCGG CGCACGGCCA CGTTGGCGGC CGGCGAGGTG ATCACCGCGC GGGACTTCGC CTGCGTGAGC GGGCAGTGCC TCTCGGCGGT GCTCTGGAAG GGCAGCACGC AGAAGCCCGC GGGCGCGGCC CTGGTCGCCG CTCCGGCGGC CGCGATCCCG GTGCCGATGA GCCCGGACGC GTCGCCCCCG GCGGCCGAGG GCGAGCCGCC CGCCGCCTGC CCGGGCGCGG AGACCTGCCC GGTCGTGCGG CCGCGCCCGA CCGAGCGGGA GCAGCTGCTG CAGGCGATGG AGCGCGCCGG CTGGGTGCAG GCCAAGGCCG CCCGCCTCCT CAAGATCACG CCCCGCCAGA TCGGCTACGC CCTGCGCAAG CACGGCATCG ACATCAAGCG GATCTGA
|
Protein sequence | MVERVVPARL AQSQPQPPRP APQAPRARAA PPSPPPSPAP SPAASPSPAP RPAARGTSPE VALIGIYEIS KVLTAARRLE VTLANVVNIL SSMLSMHHGM IVILDREGGP DIVSSTGWTP QVAHHIRARV PQKAIDQIVA TATPLVVQDV AADPLFAGHL DLFEDAGKAT VSFIGVPIKA DSRVLGTLSI DRIWDGSARF RFDEDTRFLT MVANLVGQAV RLHNTVAQDR DQLIAKTHRL EKALAESSAT SSSVGIIGES QKVKRLVAMT EVAARSNTTV LLRGESGTGK ELFARAIHDL SPRKGKPFVR VNCAALAESV LESELFGHEK GAFTGAVGTR QGRFELAHGG TLFLDEIGEV SATFQAKLLR VLQEGEFERV GGNRTLRVDV RLVFATNRNL EEAVAKGDFR ADLYYRINVV SLLLPPLRER QGDIPDLARA FLSRYNAENK SKLAFAQTAL DVLQKCYFPG NVRELENCVR RTATLAAGEV ITARDFACVS GQCLSAVLWK GSTQKPAGAA LVAAPAAAIP VPMSPDASPP AAEGEPPAAC PGAETCPVVR PRPTEREQLL QAMERAGWVQ AKAARLLKIT PRQIGYALRK HGIDIKRI
|
| |