Gene Mpe_A1922 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1922 
Symbol 
ID4786683 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2061732 
End bp2063210 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content65% 
IMG OID640090492 
ProductNusA antitermination factor 
Protein accessionYP_001021115 
Protein GI124267111 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA
[TIGR01954] transcription termination factor NusA, C-terminal duplication 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0208405 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCGCG ATCTGTTGGA CTTTGTGGAT GCGATCGCTC GCGAGAAGAG CGTCGAGCGT 
GATGTCGTGT TCGAGGCCGT CGAGGCGGCG CTCGCCTCGG CGAGCAAGAA ACTGCATGGC
GGCGAGGTCG ACATCCGCGT ATCGGTCGAC CGCGACACGG GCGAGTACGA AACGTTTCGC
CGCTGGCTCG TCGTGCCCGA CAGCGCCGGC CTGCAGAACG CCGACGCCGA GGAACTGCTG
ACCGATGCCC GCGACCGCAT CGAAGACATC GAGGAAGGCG ACTACATCGA GGAAGCCATC
GAGTCGGTGT CGATCGGGCG CATCGGTGCG CAGGCCGCCA AGCAGGTGAT CCTGCAGAAG
GTGCGCGACG CCGAGCGCGA GCAGTTGCTC AATGACTTCC TGTCGCGCGG CGACAAGATC
TTCGTCGGCA CCGTCAAGCG CCTCGACAAG GGCGACCTCG TCGTCGAGAG CGGCCGGGTC
GAGGGGCGTC TGAAGCGAAG TGAGCTGATT GCCAAGGAAA ACCTCCGCAC CGGCGACCGC
GTTCGCGCCT ACATCACGGA AGTGGACACC ACGCAACGCG GGCCGCAGAT CATGCTGTCG
CGCAGCGCAC CTGGCTTCAT GGTGGAACTG TTCCGCCACG AGGTTCCGGA GATCGAGCAG
GGCCTGCTCG AGATCAAAAG CTGCGCCCGA GATGCAGGTT CGCGCGCCAA GATCGCCGTG
CTGTCGCACG ACAAGCGGGT CGACCCGATC GGCACCTGCG TGGGTGTGCG CGGCTCGCGC
GTCAATGCCG TCACCAACGA ACTGGCTGGC GAGCGTGTCG ACATCGTGCT GTGGTCGGCC
GATCCGGCGC AGTTCGTGAT CGGCGCGTTG GCGCCGGCCA ATGTGCAGTC GATCGTGGTC
GACGAGGAAA AGCATGCGAT GGACGTGGTG GTCGACGAGG AGAACCTCGC CATCGCCATC
GGCCGCGGCG GCCAGAATGT GCGCCTCGCT TCTGAGCTGA CCGGCTGGCG CATCAACATC
ATGAGCGCCG AGGAGTCGCA GGACAAGCAG GCGACCGAAT CGGAGTCGAT CCGCAAGCTG
TTTGTCGAGA AGCTCGACGT CGATGCCGAG GTGGCCGACA TCCTGATCGC CGAGGGCTTC
ACCAGCCTCG AGGAAGTGGC CTACGTGCCG CTGCAGGAAA TGCTGGAGAT GGAGTCCTTC
GATGAGGACA CCGTCCACGA GTTGCGCACG CGGGCCAAGG ACGCACTGCT GACGATGGAG
ATCGCCCAGG AAGAGAAGCT CGAAAGCGTT TCGCAGGATC TGCGCGACCT CGAGGGGCTC
GACGCCGAGC TGATCGCCAG GCTGGCCGAA GGCGGCATCC ACACGCGCGA CGACCTGGCC
GATCTCGCGG TCGACGAACT GACCGAGCTG ACCGGCGTGG CCGACGAGCA GGCGAAGGCC
CTGATCATGA AGGCGCGCGA GCACTGGTTC ACTGCCTGA
 
Protein sequence
MNRDLLDFVD AIAREKSVER DVVFEAVEAA LASASKKLHG GEVDIRVSVD RDTGEYETFR 
RWLVVPDSAG LQNADAEELL TDARDRIEDI EEGDYIEEAI ESVSIGRIGA QAAKQVILQK
VRDAEREQLL NDFLSRGDKI FVGTVKRLDK GDLVVESGRV EGRLKRSELI AKENLRTGDR
VRAYITEVDT TQRGPQIMLS RSAPGFMVEL FRHEVPEIEQ GLLEIKSCAR DAGSRAKIAV
LSHDKRVDPI GTCVGVRGSR VNAVTNELAG ERVDIVLWSA DPAQFVIGAL APANVQSIVV
DEEKHAMDVV VDEENLAIAI GRGGQNVRLA SELTGWRINI MSAEESQDKQ ATESESIRKL
FVEKLDVDAE VADILIAEGF TSLEEVAYVP LQEMLEMESF DEDTVHELRT RAKDALLTME
IAQEEKLESV SQDLRDLEGL DAELIARLAE GGIHTRDDLA DLAVDELTEL TGVADEQAKA
LIMKAREHWF TA