Gene Msil_2297 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_2297 
Symbol 
ID7090281 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp2489201 
End bp2490799 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content64% 
IMG OID643465620 
ProductNusA antitermination factor 
Protein accessionYP_002362590 
Protein GI217978443 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA
[TIGR01954] transcription termination factor NusA, C-terminal duplication 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.0358902 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGTCA GCGCCAACAG GCTCGAACTT TTGCAGATCG CCGACGCGGT GGCGCGGGAA 
AAATCGATCG ACCGTCAAAT CGTTCTCTCC TCGATGGAGG ATGCGATCCA GAAGGCGGCG
CGCTCGCGCT ACGGTCAGGA GACCGAGGTT CGCGCCGAGA TCAATCCGAA GACCGGCGAA
ATCCGCTTCT CGCGCCTGCT GCTGGTGGTC GATCAGATTG AAAATGACGC CATCCACATC
ACGCTTGAGG ACGCCCGCAA GAAGAACCCG GCGGCGCAGG TCGGCGACTG GATCGCCGAG
ACCCTGCCGC CGTTTGACTT TGGCCGCATC GCCGCCCAGT CGGCGAAGCA GGTCATCGTG
CAGAAGGTGC GCGAGGCCGA GCGCGACCGT CAGTATCAGG AATATAAGGA TCGCATCGGC
GACATCGTCA ACGGCGTCGT CAAGCGCGTC GAATATGGCA ATGTGATCAT CGATCTCGGG
CGCGGCGAGG CGACGATCCG CCGCGACGAA ATGATCCCGC GCGAGATGTT CCGGCCGGGC
GACCGCGCCA GGGCCTATGT CTATGACGTG CGCCGCGAAC AGCGCGGGCC GCAGATTTTC
CTCTCGCGCA CGCATCCGCA GTTCATGGCC AAGCTGTTCC AGCAGGAAGT GCCGGAAATC
TACGACAATA TCATCCAGGT GAAGGCGGTC GCCCGCGACC CGGGCTCCCG CGCCAAAATC
GCGGTGATTT CGCGCGACGC CTCGATCGAT CCGGTCGGCG CCTGCGTCGG CATGCGCGGC
TCGCGCGTGC AGGCCGTCGT GAATGAATTG CAGGGCGAGA AGATCGACAT CATCCCCTGG
TCGCCGGACG CCGCGACCTT CATCGTCAAT GCGCTGCAGC CGGCCGAAGT GGTCAAGGTC
GTGCTTGACG AAGACTCGGC GCGTATTGAA GTTGTGGTGC CAGATGACCA ATTGTCATTG
GCGATCGGAC GCCGCGGCCA GAACGTCCGC CTCGCCTCGC AGCTGACCGG CTGGGACATC
GACATCCTGA CCGAGGCCGA GGAATCGGAG CGCCGGCAGA AGGAGTTCGT CGAGCGCACC
AACGCCTTCA TGAACGCGCT CGACGTCGAT GAGGTGGTCG GCCAATTGCT CGCCTCGGAA
GGTTTCCGGT CGGTGGAGGA GCTCGCTTTC GTCGAGCCGG CCGAGCTGGC CGCGATCGAA
GGCTTTGACG AAGACACCGC CGTCGAGATC CAGGCGCGGG CGCAGGACTA TCTGGCGCGC
ATCGAGGCCG AGCAGGACCA GCGCCGCATC GAACTTGGCG TCGCCGACGA GCTCCGGGAG
GTCGCCGGCG TCACCACGGC GATGCTGGTC AAATTCGGCG AGAACGACGT CAAGACGGTC
GAGGACCTCG CCGGCTGCGC GACCGACGAT CTGATCGGCT GGACCGAGCG CAAGGAAGGC
GAAAGCGTCA AGCACGCCGG CTATCTCGAT GGCTTCGAGC TGACCCGCGA AGAGGCCGAG
ACGATGATCA TGACCGCCCG CGTTCATGCC GGCTGGATTG ACGCCATCCC GCAGCCCGCG
GTTGAAGAGC CGCAGCTCGA GGGAGAGGTT CGGGACTGA
 
Protein sequence
MAVSANRLEL LQIADAVARE KSIDRQIVLS SMEDAIQKAA RSRYGQETEV RAEINPKTGE 
IRFSRLLLVV DQIENDAIHI TLEDARKKNP AAQVGDWIAE TLPPFDFGRI AAQSAKQVIV
QKVREAERDR QYQEYKDRIG DIVNGVVKRV EYGNVIIDLG RGEATIRRDE MIPREMFRPG
DRARAYVYDV RREQRGPQIF LSRTHPQFMA KLFQQEVPEI YDNIIQVKAV ARDPGSRAKI
AVISRDASID PVGACVGMRG SRVQAVVNEL QGEKIDIIPW SPDAATFIVN ALQPAEVVKV
VLDEDSARIE VVVPDDQLSL AIGRRGQNVR LASQLTGWDI DILTEAEESE RRQKEFVERT
NAFMNALDVD EVVGQLLASE GFRSVEELAF VEPAELAAIE GFDEDTAVEI QARAQDYLAR
IEAEQDQRRI ELGVADELRE VAGVTTAMLV KFGENDVKTV EDLAGCATDD LIGWTERKEG
ESVKHAGYLD GFELTREEAE TMIMTARVHA GWIDAIPQPA VEEPQLEGEV RD