Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_2297 |
Symbol | |
ID | 7090281 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | - |
Start bp | 2489201 |
End bp | 2490799 |
Gene Length | 1599 bp |
Protein Length | 532 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643465620 |
Product | NusA antitermination factor |
Protein accession | YP_002362590 |
Protein GI | 217978443 |
COG category | [K] Transcription |
COG ID | [COG0195] Transcription elongation factor |
TIGRFAM ID | [TIGR01953] transcription termination factor NusA [TIGR01954] transcription termination factor NusA, C-terminal duplication |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.0358902 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGTCA GCGCCAACAG GCTCGAACTT TTGCAGATCG CCGACGCGGT GGCGCGGGAA AAATCGATCG ACCGTCAAAT CGTTCTCTCC TCGATGGAGG ATGCGATCCA GAAGGCGGCG CGCTCGCGCT ACGGTCAGGA GACCGAGGTT CGCGCCGAGA TCAATCCGAA GACCGGCGAA ATCCGCTTCT CGCGCCTGCT GCTGGTGGTC GATCAGATTG AAAATGACGC CATCCACATC ACGCTTGAGG ACGCCCGCAA GAAGAACCCG GCGGCGCAGG TCGGCGACTG GATCGCCGAG ACCCTGCCGC CGTTTGACTT TGGCCGCATC GCCGCCCAGT CGGCGAAGCA GGTCATCGTG CAGAAGGTGC GCGAGGCCGA GCGCGACCGT CAGTATCAGG AATATAAGGA TCGCATCGGC GACATCGTCA ACGGCGTCGT CAAGCGCGTC GAATATGGCA ATGTGATCAT CGATCTCGGG CGCGGCGAGG CGACGATCCG CCGCGACGAA ATGATCCCGC GCGAGATGTT CCGGCCGGGC GACCGCGCCA GGGCCTATGT CTATGACGTG CGCCGCGAAC AGCGCGGGCC GCAGATTTTC CTCTCGCGCA CGCATCCGCA GTTCATGGCC AAGCTGTTCC AGCAGGAAGT GCCGGAAATC TACGACAATA TCATCCAGGT GAAGGCGGTC GCCCGCGACC CGGGCTCCCG CGCCAAAATC GCGGTGATTT CGCGCGACGC CTCGATCGAT CCGGTCGGCG CCTGCGTCGG CATGCGCGGC TCGCGCGTGC AGGCCGTCGT GAATGAATTG CAGGGCGAGA AGATCGACAT CATCCCCTGG TCGCCGGACG CCGCGACCTT CATCGTCAAT GCGCTGCAGC CGGCCGAAGT GGTCAAGGTC GTGCTTGACG AAGACTCGGC GCGTATTGAA GTTGTGGTGC CAGATGACCA ATTGTCATTG GCGATCGGAC GCCGCGGCCA GAACGTCCGC CTCGCCTCGC AGCTGACCGG CTGGGACATC GACATCCTGA CCGAGGCCGA GGAATCGGAG CGCCGGCAGA AGGAGTTCGT CGAGCGCACC AACGCCTTCA TGAACGCGCT CGACGTCGAT GAGGTGGTCG GCCAATTGCT CGCCTCGGAA GGTTTCCGGT CGGTGGAGGA GCTCGCTTTC GTCGAGCCGG CCGAGCTGGC CGCGATCGAA GGCTTTGACG AAGACACCGC CGTCGAGATC CAGGCGCGGG CGCAGGACTA TCTGGCGCGC ATCGAGGCCG AGCAGGACCA GCGCCGCATC GAACTTGGCG TCGCCGACGA GCTCCGGGAG GTCGCCGGCG TCACCACGGC GATGCTGGTC AAATTCGGCG AGAACGACGT CAAGACGGTC GAGGACCTCG CCGGCTGCGC GACCGACGAT CTGATCGGCT GGACCGAGCG CAAGGAAGGC GAAAGCGTCA AGCACGCCGG CTATCTCGAT GGCTTCGAGC TGACCCGCGA AGAGGCCGAG ACGATGATCA TGACCGCCCG CGTTCATGCC GGCTGGATTG ACGCCATCCC GCAGCCCGCG GTTGAAGAGC CGCAGCTCGA GGGAGAGGTT CGGGACTGA
|
Protein sequence | MAVSANRLEL LQIADAVARE KSIDRQIVLS SMEDAIQKAA RSRYGQETEV RAEINPKTGE IRFSRLLLVV DQIENDAIHI TLEDARKKNP AAQVGDWIAE TLPPFDFGRI AAQSAKQVIV QKVREAERDR QYQEYKDRIG DIVNGVVKRV EYGNVIIDLG RGEATIRRDE MIPREMFRPG DRARAYVYDV RREQRGPQIF LSRTHPQFMA KLFQQEVPEI YDNIIQVKAV ARDPGSRAKI AVISRDASID PVGACVGMRG SRVQAVVNEL QGEKIDIIPW SPDAATFIVN ALQPAEVVKV VLDEDSARIE VVVPDDQLSL AIGRRGQNVR LASQLTGWDI DILTEAEESE RRQKEFVERT NAFMNALDVD EVVGQLLASE GFRSVEELAF VEPAELAAIE GFDEDTAVEI QARAQDYLAR IEAEQDQRRI ELGVADELRE VAGVTTAMLV KFGENDVKTV EDLAGCATDD LIGWTERKEG ESVKHAGYLD GFELTREEAE TMIMTARVHA GWIDAIPQPA VEEPQLEGEV RD
|
| |