Gene Mchl_5020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_5020 
Symbol 
ID7115018 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp5365863 
End bp5367902 
Gene Length2040 bp 
Protein Length679 aa 
Translation table11 
GC content70% 
IMG OID643527714 
Productprotein of unknown function UPF0118 
Protein accessionYP_002423713 
Protein GI218532897 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.247687 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.0740005 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGTA TATCCACCGG TGAGGGCTTC ATCGTGCCGC CGCGCCCGGC GCGTGTTTCG 
GCGGCCGAGA CGCCCAAAGG TCCGCTCGCC TCCTCCCTCG TCGTGTTTGC GATCATCGTC
GCAGGCCTCT TCTTCGCGCG CGAGGTGCTG ATCCCCATCG CGATCGCGGT GCTGCTGTCC
TTCGTGCTCG GCCCGCTGGT GAACTTCCTG CGCCGGCTGA AGCTGGGGCG GGCCTTCGCG
GTGCTGGTCT CGGTACTGCT CGCCGCCGGG ATCATCGCTG CGGTGACCAC CGTCATCGGC
GTGCAGGTCG CCGAACTCGC GCAGGACGTG CCGCGCTACC AGCGCACCGT CGAGCGCAAG
ATCGAGGGGC TGCGGGCCGG CACGCTCGGC CAGACGATGG ACTACATCGC CAACATCAAC
CGGGCGATCC ACCAGAGCGG CGAGGAGAGC AAGGAATCGG CCGAGAAGGC CAAGCAGCAG
GCCGCCCGCG ACAGTAACCG CAAGGCCGAG CCGGAGCCGC CGAAGCCGCT GCTGGTCCAG
GTGGAGGAGC GCCGCCCGGG CCCGCTCGAA CTCGCCACCA CCGTGCTGGC GCCGGTGGCG
CAGCCCCTCG CCACCGCCGG CATCGTCTTC GTCGTCCTCC TGTTCATCCT GATGCAGCGG
GAGGATCTGC GCGACCGCTT GATCCGGCTC GCCGGTTCGA GCGACCTCCA CCGCACGACC
GTGGCGATGG ACGACGCGGC CCGGCGGCTC TCGCGCTACT TCCTGGGCCA GCTCGCCCTC
AACTCCGCCT TCGGCGTCGT CATCGGCGTC GGGCTCTGGA TCATCGGCGT GCCGAGCCCC
GTGCTGTGGG GCATCTTCGC CCTCGTCATG CGCTTCGTGC CCTATATCGG CGCCTTCCTC
TCGGCGGTCC TGCCGATCGC CCTCGCGGCG GCGGTCGATC CGGGCTGGAG CATGGTACTG
TCGACCTTCC TGCTCTTCGC CCTCGTCGAG CCCATCGTCG GGCAGATCAT CGAGCCCCTG
GTCTATGGCC ACTCGACGGG CGTCTCGCCG TTCTCGGTGC TGGTCTCGGC CCTGTTCTGG
ACCTGGCTGT GGGGACCGGT GGGCCTCCTG CTCTCGACCC CGCTCACCGT CTGCCTCGTC
GTGCTTGGGC GTCACGTCGA TCGGCTCGAA TTCCTCGACG TGCTGTTCAG CAACCGCCCG
GCGCTGACGC CGATCGAGAA CTTCTACCAG CGCATGCTCG CCGACGACCC GGAGGAGGCG
CAGGAACATG CCGACCTGAT CCTGCGCGAA TGCTCGCTCT CGGCCTATTA CGACGACGTC
GTGCTGAAGG GTCTCGAACT CGCCTCCCGC GACGCCGCCC GCGGCGTGCT GACGCCGAGC
CAGAAGGACG AGATCCGCGC CTCGATCACG GCTTTGGTGG AGGATCTGGA GGATCGCGAG
GATGCCGTGC CCGATCCGGC CTCCAAACCC ACCCGCCTCT TCGACGGGGC AGGCGCTGCT
GGCGACGGGG AGAGCGCCTG CAAGAACGGG CCGATCGCTG ACCCCGACCC CGACGAGTTG
CCCGCCGCGT GGCAGGCCGA CGGTGCGGTG CTGCTCGTCT CCGGCCGCGG CTTCCTCGAC
GGAGCGGCGA CCGCCATCGC CGACCAACTC CTGCGCAAGC GCGGCTTCGG CACCCGGCAG
GTGCCCTTCG CCGAGGTCGC CCGAGTGCGC ATCGCCGAGT GGCAGCCGGG TCCGGCTCAA
GCGGTCTGCG TGATCTCGCT GGCGCTGTCC GGCGAGCCGA CCCATCTGCG CCGCCTCGTC
TCGCGGCTGC GCCAGAAGAT CGACACCGTG CCGATCGTCG CCGGGCTGTG GCGCCTCGAC
GAGCCGATGC TCGCCGACGA TGCGGTGCGG GCCAAGATCG GCGCGGACGA CCACGTCACC
TCCTTGCGCG ACCTGATCGA GACGATCCTC GCCCTCGCCC GCAACGAGGG CGGCCCGGAC
CGGTCCGAGC GGAGCGATGC GGCCACGCCC CCGCGGCAGG CCCTGCCCGA GCCGGCTTGA
 
Protein sequence
MKRISTGEGF IVPPRPARVS AAETPKGPLA SSLVVFAIIV AGLFFAREVL IPIAIAVLLS 
FVLGPLVNFL RRLKLGRAFA VLVSVLLAAG IIAAVTTVIG VQVAELAQDV PRYQRTVERK
IEGLRAGTLG QTMDYIANIN RAIHQSGEES KESAEKAKQQ AARDSNRKAE PEPPKPLLVQ
VEERRPGPLE LATTVLAPVA QPLATAGIVF VVLLFILMQR EDLRDRLIRL AGSSDLHRTT
VAMDDAARRL SRYFLGQLAL NSAFGVVIGV GLWIIGVPSP VLWGIFALVM RFVPYIGAFL
SAVLPIALAA AVDPGWSMVL STFLLFALVE PIVGQIIEPL VYGHSTGVSP FSVLVSALFW
TWLWGPVGLL LSTPLTVCLV VLGRHVDRLE FLDVLFSNRP ALTPIENFYQ RMLADDPEEA
QEHADLILRE CSLSAYYDDV VLKGLELASR DAARGVLTPS QKDEIRASIT ALVEDLEDRE
DAVPDPASKP TRLFDGAGAA GDGESACKNG PIADPDPDEL PAAWQADGAV LLVSGRGFLD
GAATAIADQL LRKRGFGTRQ VPFAEVARVR IAEWQPGPAQ AVCVISLALS GEPTHLRRLV
SRLRQKIDTV PIVAGLWRLD EPMLADDAVR AKIGADDHVT SLRDLIETIL ALARNEGGPD
RSERSDAATP PRQALPEPA