Gene Mchl_4249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_4249 
Symbol 
ID7117483 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp4474725 
End bp4475846 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content47% 
IMG OID643526947 
ProductProtein of unknown function DUF1972 
Protein accessionYP_002422953 
Protein GI218532137 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.444925 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.0986637 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGGCG AGAATGCTAA AAAATTATGT ATATTGGGTA CCCGCGGTAT TCCAGCAAGA 
CATGGAGGAT TTGAGACGTT TGCAGAAAAT TTATCAATCT ATTTAGTAAA GCGGGGTTGG
ACGATATGCG TTTACTGCCA AGGCGAAAAT GGTGGGCCTT CGTTGCGAAA TCACAGAACC
GTGTGGAACG GCGTAAACCT GATCACTATT TATCCTTTCA CCACCGGCGC GCTTTCAACT
ATCGAATTCG ACATCAGATC GATGTTGCAT TGCCTGGGAA GCAAACAAAA GCTGCTTGTT
TTAGGATATA ACACTGCATT TCTATTGATT CTCGCCAAAA TCTTTCGAAG GTTTGTTGCT
ATAAATATGG ACGGCATCGA ATGGAAAAGA CCCAAGTGGT CCAAACCCGC GAAGGCGTGG
TTGTATATGA ATGAGCGCCT TGGTGTCCAC TTGGGTGATC GTGTTGTCGC AGATCATCCA
GCTATTAGCA AATATCTGCA GAGTATTTGT CGCCGCAGTA TCGATATGAT ACCTTATGGA
GCAGAGGAGG TCGTAGCGGC AGACGAGACA ATCTTGGGCA GGATTGGCGG TGAGCCTCAC
AACTATTTTC TGTGCATAGG GAGGATCGAA CCTGAAAATT CCATTCTTGA GATTGTCAAG
GCATACTGCC TGACTGCAAG AAAGGAAAAA CTTATCATTC TTGGAGATTT GAGCAAAGCT
AGTGCCGCCT ACCGAGAAAG TGTTATACGG GCTGCGAATG GGAAGGTTCT GTTCCCGGGT
GCGATTTACG ACAAGGAAGT AGTTTGGACG CTCCGGAAGC TCGCTCTTTG CTATGTTCAT
GGCCATCAGG TTGGAGGAAC AAATCCGTCT CTAGTCGAAA GCCTTGCAGC CGCACGCCCA
GTCTTGGCTC ACCGTAACAA GTTCAATGTT TGGACTGCTG GGAGTAGACA ATTTTACTTC
GACTCGGTGG AAAGCTGCGC AGCAGCAATG ACGGATATAA GCACAAACGG AGAGGCGACA
AGCGTGGCTG CAGCCGCCGC CCTCGAGCAA TTTAGGCAAC ATTTCACCTG GGAAATAGTC
CTTAGAGCGT ACGAAGACAT CTTATACGAT GCTCCATTAT GA
 
Protein sequence
MTGENAKKLC ILGTRGIPAR HGGFETFAEN LSIYLVKRGW TICVYCQGEN GGPSLRNHRT 
VWNGVNLITI YPFTTGALST IEFDIRSMLH CLGSKQKLLV LGYNTAFLLI LAKIFRRFVA
INMDGIEWKR PKWSKPAKAW LYMNERLGVH LGDRVVADHP AISKYLQSIC RRSIDMIPYG
AEEVVAADET ILGRIGGEPH NYFLCIGRIE PENSILEIVK AYCLTARKEK LIILGDLSKA
SAAYRESVIR AANGKVLFPG AIYDKEVVWT LRKLALCYVH GHQVGGTNPS LVESLAAARP
VLAHRNKFNV WTAGSRQFYF DSVESCAAAM TDISTNGEAT SVAAAAALEQ FRQHFTWEIV
LRAYEDILYD APL