Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A1922 |
Symbol | |
ID | 4786683 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 2061732 |
End bp | 2063210 |
Gene Length | 1479 bp |
Protein Length | 492 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640090492 |
Product | NusA antitermination factor |
Protein accession | YP_001021115 |
Protein GI | 124267111 |
COG category | [K] Transcription |
COG ID | [COG0195] Transcription elongation factor |
TIGRFAM ID | [TIGR01953] transcription termination factor NusA [TIGR01954] transcription termination factor NusA, C-terminal duplication |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0208405 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCGCG ATCTGTTGGA CTTTGTGGAT GCGATCGCTC GCGAGAAGAG CGTCGAGCGT GATGTCGTGT TCGAGGCCGT CGAGGCGGCG CTCGCCTCGG CGAGCAAGAA ACTGCATGGC GGCGAGGTCG ACATCCGCGT ATCGGTCGAC CGCGACACGG GCGAGTACGA AACGTTTCGC CGCTGGCTCG TCGTGCCCGA CAGCGCCGGC CTGCAGAACG CCGACGCCGA GGAACTGCTG ACCGATGCCC GCGACCGCAT CGAAGACATC GAGGAAGGCG ACTACATCGA GGAAGCCATC GAGTCGGTGT CGATCGGGCG CATCGGTGCG CAGGCCGCCA AGCAGGTGAT CCTGCAGAAG GTGCGCGACG CCGAGCGCGA GCAGTTGCTC AATGACTTCC TGTCGCGCGG CGACAAGATC TTCGTCGGCA CCGTCAAGCG CCTCGACAAG GGCGACCTCG TCGTCGAGAG CGGCCGGGTC GAGGGGCGTC TGAAGCGAAG TGAGCTGATT GCCAAGGAAA ACCTCCGCAC CGGCGACCGC GTTCGCGCCT ACATCACGGA AGTGGACACC ACGCAACGCG GGCCGCAGAT CATGCTGTCG CGCAGCGCAC CTGGCTTCAT GGTGGAACTG TTCCGCCACG AGGTTCCGGA GATCGAGCAG GGCCTGCTCG AGATCAAAAG CTGCGCCCGA GATGCAGGTT CGCGCGCCAA GATCGCCGTG CTGTCGCACG ACAAGCGGGT CGACCCGATC GGCACCTGCG TGGGTGTGCG CGGCTCGCGC GTCAATGCCG TCACCAACGA ACTGGCTGGC GAGCGTGTCG ACATCGTGCT GTGGTCGGCC GATCCGGCGC AGTTCGTGAT CGGCGCGTTG GCGCCGGCCA ATGTGCAGTC GATCGTGGTC GACGAGGAAA AGCATGCGAT GGACGTGGTG GTCGACGAGG AGAACCTCGC CATCGCCATC GGCCGCGGCG GCCAGAATGT GCGCCTCGCT TCTGAGCTGA CCGGCTGGCG CATCAACATC ATGAGCGCCG AGGAGTCGCA GGACAAGCAG GCGACCGAAT CGGAGTCGAT CCGCAAGCTG TTTGTCGAGA AGCTCGACGT CGATGCCGAG GTGGCCGACA TCCTGATCGC CGAGGGCTTC ACCAGCCTCG AGGAAGTGGC CTACGTGCCG CTGCAGGAAA TGCTGGAGAT GGAGTCCTTC GATGAGGACA CCGTCCACGA GTTGCGCACG CGGGCCAAGG ACGCACTGCT GACGATGGAG ATCGCCCAGG AAGAGAAGCT CGAAAGCGTT TCGCAGGATC TGCGCGACCT CGAGGGGCTC GACGCCGAGC TGATCGCCAG GCTGGCCGAA GGCGGCATCC ACACGCGCGA CGACCTGGCC GATCTCGCGG TCGACGAACT GACCGAGCTG ACCGGCGTGG CCGACGAGCA GGCGAAGGCC CTGATCATGA AGGCGCGCGA GCACTGGTTC ACTGCCTGA
|
Protein sequence | MNRDLLDFVD AIAREKSVER DVVFEAVEAA LASASKKLHG GEVDIRVSVD RDTGEYETFR RWLVVPDSAG LQNADAEELL TDARDRIEDI EEGDYIEEAI ESVSIGRIGA QAAKQVILQK VRDAEREQLL NDFLSRGDKI FVGTVKRLDK GDLVVESGRV EGRLKRSELI AKENLRTGDR VRAYITEVDT TQRGPQIMLS RSAPGFMVEL FRHEVPEIEQ GLLEIKSCAR DAGSRAKIAV LSHDKRVDPI GTCVGVRGSR VNAVTNELAG ERVDIVLWSA DPAQFVIGAL APANVQSIVV DEEKHAMDVV VDEENLAIAI GRGGQNVRLA SELTGWRINI MSAEESQDKQ ATESESIRKL FVEKLDVDAE VADILIAEGF TSLEEVAYVP LQEMLEMESF DEDTVHELRT RAKDALLTME IAQEEKLESV SQDLRDLEGL DAELIARLAE GGIHTRDDLA DLAVDELTEL TGVADEQAKA LIMKAREHWF TA
|
| |