Gene Mchl_5666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_5666 
Symbol 
ID7119015 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011758 
Strand
Start bp297893 
End bp299113 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content59% 
IMG OID643528320 
Producthypothetical protein 
Protein accessionYP_002424316 
Protein GI218533501 
COG category[S] Function unknown 
COG ID[COG3673] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.171189 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAAGA ACATCGTCAT TTGCTGTGAC GGCACAGGGA ATGAGATTGG AGAAACAATT 
TCAAATGTTT TGAAACTTTA CAGGATTCTT GAAAAAGACG ATACTCAGCG TGTCTATTAC
AGCTCTGGCA TTGGGACAAT CGGATATAGA AATGCATGGC AGCGGCTAAA GCAGGAAACC
CGCGGGGTAT TTGGCTTGGC CACGGGATAC GGCCTCGATG AGGACGTGCT CGCAGCCTAT
CGCTTTCTCT GCGACGCGTA CGAGGATGGT GACCGGGTCT GGCTTTTCGG CTTCAGCCGG
GGCGCCTACA CCGTTCGCGC ATTGGCCGGG TTCATCAACG TCATCGGGCT GCTGCGTCCA
GATCAAGTCA ACCTGGCAGG ATACGCGTTC GCGGCCTACA AGCAGGTGAG CGTCAGCAAT
AGAGCGCCTG AGTGGCCCGG CGAACCCGCC AAGGTGGATA ACCCCGCCAT GAAGGCGGCT
TGGCAGTTCG CCAGGATCGC GGGCGCCTAT CCTATCAGGA TCGAGTTCAT CGGGGTTTGG
GATACCGTCG CCTCCATAAT TGCCCCACGC AAGGACCGCC TTTTCCCGAC CCTCCAGACG
TTGCCCTACA CGCGCGTGAA CCCCTGCGTG AAAGCCTTCA GGCAGGCGAT CGCGATCGAC
GAATGCAGAC GCATGTTCCG CCTGAACAGG TGGGGCGACA GACAGCCATT CCGGAAGGAT
CCCTACGAGC GGGCCTCGGT GGCCGAACAG GACGTGCGCC AGGTCTGGTT CGCCGGTGTC
CATGCCGACA TCGGCGGCGG ATATCCCGAA GCGCAGAGCG GCATCTCGAA ATTCCCGCTG
CTCTGGATGG TCAGGCACGC GCAGGCGAAA GGCCTGCGAT GCGACCGCCG CCTCATTGAT
CATTTGGTGT TGGGTATATC ATCCGACGAC GCCTCCACCG ATTACGTTCC ACCGGACGTG
CGGGGAAAGA CCCACGACTC AATGAATGCC GGTTGGCGCA TCCTTGAGTG GCTACCGAAA
AAAGCCAGAT GGCGCGAGTG GCCAGCCCGC AAGGTGGTCG CCGGTTTCTA TCTTCCGCGC
GCCGAGCCAC GTCGGATCCC GGACGGGGCC CGAATTCACT GGTCCGTCTT CCAGAAAATG
GAGCAGGACC CAGGATACAA GCCGCTCAAC CTGCCGGACT CGCACCTTAG AGAGGAGGAT
CTCGCTCCGC CGGAGTTATA G
 
Protein sequence
MPKNIVICCD GTGNEIGETI SNVLKLYRIL EKDDTQRVYY SSGIGTIGYR NAWQRLKQET 
RGVFGLATGY GLDEDVLAAY RFLCDAYEDG DRVWLFGFSR GAYTVRALAG FINVIGLLRP
DQVNLAGYAF AAYKQVSVSN RAPEWPGEPA KVDNPAMKAA WQFARIAGAY PIRIEFIGVW
DTVASIIAPR KDRLFPTLQT LPYTRVNPCV KAFRQAIAID ECRRMFRLNR WGDRQPFRKD
PYERASVAEQ DVRQVWFAGV HADIGGGYPE AQSGISKFPL LWMVRHAQAK GLRCDRRLID
HLVLGISSDD ASTDYVPPDV RGKTHDSMNA GWRILEWLPK KARWREWPAR KVVAGFYLPR
AEPRRIPDGA RIHWSVFQKM EQDPGYKPLN LPDSHLREED LAPPEL