Gene Mchl_5049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_5049 
Symbol 
ID7118840 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp5400013 
End bp5402052 
Gene Length2040 bp 
Protein Length679 aa 
Translation table11 
GC content72% 
IMG OID643527743 
Producthypothetical protein 
Protein accessionYP_002423742 
Protein GI218532926 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.326897 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.417986 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCCTCG GCCCGCTGCG GAGCGCGGCG GCGTTCCTCG CCGCCGTCCT CATCGGATCG 
CCGCTCTTCG GCTTCACCGC GGCGGTTCGG GGCGGCGAGG CGAAAGCGCC GCTGGTCCTG
CGGGTTGCGC CCACCGGAGA CAAAGCCCCC CGTCGCGACG CCCGCTTCGC GGATCTGCCG
CGGGCGCTCG CATACGTCGC CGCGCTGCGC CGTCAGGGGG AGGCGCGGGC GATCGTCGTC
GAGCTGGAAC CCGGAACGCA CCGGATCTCG GCGCCCGTCC GGATCGGCCC CCACCATGCC
GGCACGGAAG GGGCGCCCCT GATCCTGCGC GGGGCCGACG ACGGATCGAG CCGGCTCGTC
GGCAGCGTGC CCCTCACGCC CGCATCGCTG CCGCCGCGCC TGCGCGCGCG GTTGCCCGCC
TCGGCCCGCG GCGCAGTGCG CGCCTACCAA TTGCCCGAGG CTTTGCGTGG GGAGCTCGCC
TACCGCGCGC CGCGGCGCCT GCGCGAGACG TACCCGCGCG TCACCGAGAT CTTCGATGCC
GGCGGTGCGC TGCGCCCGGC GCAGTGGCCG AACCCTGGAC CGAACTCCGG CTGGACGACG
GTCGCCGCTG CCGAAGCGGG GGGCCTGGCC TTCACCCTCA AGGACGCGTC GGGCCTGCCC
GATCTGTCCC TGGAGCGCGA CCTGTGGGTG GAGGGCTTCT GGCGCTGGGA CTGGCTGCTG
GAAACGCTTC ACGTGGCGCA GGTCGATCCG CGCCGCCGGC TCGAACTGGA CCAGCCGCCC
TACGAGGGCA TCCGCGACGG CGCCCGGATG CGGCTGGTCC ATGCGCTCGG TGCCCTCGAT
GAACCCGGCG AATGGTGGCG CGACGCCGAG AGCGGCCTGC TGCTGGCCTG GCCGTCTCCC
GGCGCGGACG ACCTCGAAGT CAGCCTCGCC GAGACGCTGA TCCAGGCCGA TGGCGCGCGG
CACCTACGCA TCGAGCGGCT TCGGCTGGAG CGCGCGCGCG GCGATCTGAT CGTCGTGCGG
GGGGGCGAGG ATATCGAGAT CCGCGCGAGC GAACTGGCCT GGGCGGCAGG CCGGGCGGCG
GTGTTCGAGG GGGTGACCGG GGGCGGCGTC TCCGGCAGCA CGGTCCACGA TATCGGCGCG
AGCGCGGTCC GCCTCGTCGG CGGCGACCGC GCCACGCTCC GGCGGGGCGG GCTGTTCGTG
CGCGACACCC GCTTCACCCG CTTCTCGCGG CTGAGCCAGA CCCAGAGTTC CGCGATCGAA
CTCGACGGCG TCGGCGCGGA GGCGAGCGGA AACCTCATCA CCGACGCGAT CGGCTACGCC
ATCTACCTGC GCGGCAACGA CCACGTGTTT CGCGGCAACG AGGTCGCCCG CCTGATCCAC
GGCCTGAGCG ATACCGGCGC CATCTATGCC GGACGCGACT TCACCGCCCG CGGCTCGATC
ATCGAGGACA ATTACGTCCA CGACATCCGC ACCGTGCCTG GCATGGAGGT GAAGGGCGTC
TATCTCGACG ACATGGCGAG CGGCTTCACC ATCCGCCGCA ACCTGTTCGT CGATGTGCAG
CAGCCGGTCT TCATCGGCGG CGGCAACGAC AACACGATCA CCCGCAACGT CTTCGTCGCG
TCGAGCCCGA TGGTCGCTCT CGATGCGCGG GGTCTGACGT GGATGAAGCC ATCGCTGAAC
GAGGCGGATT CGGAGTTCCG GGCCGCCTTC GCCGCGATGC CGCTCGACTC CGCGCCTTGG
CGGATGCGCT ACCCGAAGCT TGCGGAGGCG CTGACCGACG AGCCCGGCGT GGCGCGCAAC
AACCAGATCG TCGATAACGT GAGCATCGGC AGCGACGACC TCGCGTTCAC CGACAAGGCG
GAGGTGGGCC GGCAGATCAT TCTGTTCAAC ACCCGCCTCG ACGGCCCGGT CCCGAATCCC
GGCGACCTCG AGGCGCTGGC CCGCTTCACC GCCGAGCGCG GCATCACGCT TCGCCTCGAC
CCGTCGAAGA TGCGGCGGGA CGGGTTACCC GTCTCGCCGT TCACGGACGC GCGGCGCTGA
 
Protein sequence
MSLGPLRSAA AFLAAVLIGS PLFGFTAAVR GGEAKAPLVL RVAPTGDKAP RRDARFADLP 
RALAYVAALR RQGEARAIVV ELEPGTHRIS APVRIGPHHA GTEGAPLILR GADDGSSRLV
GSVPLTPASL PPRLRARLPA SARGAVRAYQ LPEALRGELA YRAPRRLRET YPRVTEIFDA
GGALRPAQWP NPGPNSGWTT VAAAEAGGLA FTLKDASGLP DLSLERDLWV EGFWRWDWLL
ETLHVAQVDP RRRLELDQPP YEGIRDGARM RLVHALGALD EPGEWWRDAE SGLLLAWPSP
GADDLEVSLA ETLIQADGAR HLRIERLRLE RARGDLIVVR GGEDIEIRAS ELAWAAGRAA
VFEGVTGGGV SGSTVHDIGA SAVRLVGGDR ATLRRGGLFV RDTRFTRFSR LSQTQSSAIE
LDGVGAEASG NLITDAIGYA IYLRGNDHVF RGNEVARLIH GLSDTGAIYA GRDFTARGSI
IEDNYVHDIR TVPGMEVKGV YLDDMASGFT IRRNLFVDVQ QPVFIGGGND NTITRNVFVA
SSPMVALDAR GLTWMKPSLN EADSEFRAAF AAMPLDSAPW RMRYPKLAEA LTDEPGVARN
NQIVDNVSIG SDDLAFTDKA EVGRQIILFN TRLDGPVPNP GDLEALARFT AERGITLRLD
PSKMRRDGLP VSPFTDARR