Gene Mext_2699 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_2699 
Symbol 
ID5830946 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp3018753 
End bp3020390 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content68% 
IMG OID641368499 
Producttranscription termination factor NusA 
Protein accessionYP_001640161 
Protein GI163852118 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA
[TIGR01954] transcription termination factor NusA, C-terminal duplication 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.311383 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.577361 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGTCG TCAGCGCCAA TCGGCTCGAA CTCCTGCAGA TCGCCGAGGC GGTCGCCCGC 
GAGAAGGTGA TCGACCGCCA GATCGTCATC GAGGCGATGG AAGAGGCGAT CGCGAAGGCG
GCCCGCTCGC GCTACGGCGC CGAGACCGAC GTTCACGCCG AGATCGACAC GAAGAGCGGC
GCGCTGCGCC TGTCCCGCCA CCTCCTCGTG GTCGATCAGG TTGAGAACGA CGCCCGTGAG
ATCACCCTCG ATCAGGCCCG CCGCTACAAT CCCGGCGCCC TCATCGGCGA CGTGATCTCC
GATACCCTGC CGCCGTTCGA TTTCGGCCGC GTCGCGGCGC AATCGGCCAA GCAGGTCATC
GTCCAGAAGG TGCGCGACGC CGAGCGCGCC CGCCAGTACG ACGAGTACAA GGACCGGATC
GGCGAGATCC TCAACGGCGT GGTCAAGCGC GTCGAGTACG GCAACGTCAT CGTCGATCTC
GGCCGCGGCG AGGGCATCGT CCGCCGCGAC GAGATGATCC CGCGCGAGAC CTTCCGCCCC
GGCGACCGTA TCCGCGCCTA CCTGTTCGAC GTGCGCTCCG AGGTGCGCGG GCCGCAGATC
TTCCTGTCGC GCTCGCACCC GCAATTCATG GCCAAGCTGT TCGGCCAGGA AGTGCCGGAG
ATCTATGACG GTATCGTCGA GGTGAAAGCG GTCGCCCGCG ATCCCGGCTC GCGCGCCAAG
ATCGCGGTCA TCTCCCGCGA CTCCTCGATC GACCCGGTCG GCGCCTGCGT CGGTATGCGC
GGATCCCGCG TCCAGGCGGT GGTCGGCGAG CTTCAGGGCG AGAAGATCGA CATCATTCCG
TGGTCGGAAG ATCAGGCAAC CTTCATCGTC AACGCGCTGC AGCCGGCCGA GGTCGTGAAG
GTGGTGCTCG ACGAGGAAGC CGACCGCATC GAGGTGGTGG TGCCCGACGA CCAGCTCTCG
CTGGCCATCG GCCGCCGCGG CCAGAACGTG CGGCTGGCCT CGCAGCTCAC CGGCTGGGAC
ATCGACATCC TGACCGAGGC CGAGGAATCC GAGCGGCGCC AGAAGGAGTT CGCGGAGCGG
ACTCAGGCGT TCATGGAAGC GCTCGACGTG GACGAGACGG TTGGCCAGTT GCTGGCCGCC
GAAGGCTTCC GCAACGTCGA GGAAATCGCC TTCGTCGATG TCGCCGAACT CTCCAACATC
CAGGGCCTCG ACGAGGAGAC CGGTGCCGAG ATCCAGGCCC GCGCCCAGGA TTACCTCGCC
CGGATCGAGC AGGAGCAGGA CGACCGCCGC CGCGAACTCG GCGTCGAGGA CGAACTGCGC
GAGATCGACG GCATCACCAC CGCGATGATG GTGGCGCTGG GCGAGAACGA GGTGAAGACC
GTCGAAGATC TCGCCGGCTG CGCCACCGAC GACCTCGTCG GCTACACCGA AGGCCGCGGC
CCCGAGGCCG TGCGCCATGC CGGCTATCTC GACGGCTTCG AGCTGTCGCG GGCCGAGGCC
GAGGCGCTGA TCATGGCCGC CCGTCTGAAG GCCGGCTGGA TCGACGCGCT GCCGGAGCCG
GAGGGTGAAG CCGCCGAGGG CGACGCCCAG GACGGCGATG CGATCGAGGA AGCGACGGCC
GAGCCGCAGC AGGCTTGA
 
Protein sequence
MAVVSANRLE LLQIAEAVAR EKVIDRQIVI EAMEEAIAKA ARSRYGAETD VHAEIDTKSG 
ALRLSRHLLV VDQVENDARE ITLDQARRYN PGALIGDVIS DTLPPFDFGR VAAQSAKQVI
VQKVRDAERA RQYDEYKDRI GEILNGVVKR VEYGNVIVDL GRGEGIVRRD EMIPRETFRP
GDRIRAYLFD VRSEVRGPQI FLSRSHPQFM AKLFGQEVPE IYDGIVEVKA VARDPGSRAK
IAVISRDSSI DPVGACVGMR GSRVQAVVGE LQGEKIDIIP WSEDQATFIV NALQPAEVVK
VVLDEEADRI EVVVPDDQLS LAIGRRGQNV RLASQLTGWD IDILTEAEES ERRQKEFAER
TQAFMEALDV DETVGQLLAA EGFRNVEEIA FVDVAELSNI QGLDEETGAE IQARAQDYLA
RIEQEQDDRR RELGVEDELR EIDGITTAMM VALGENEVKT VEDLAGCATD DLVGYTEGRG
PEAVRHAGYL DGFELSRAEA EALIMAARLK AGWIDALPEP EGEAAEGDAQ DGDAIEEATA
EPQQA