Gene Mchl_4586 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_4586 
Symbol 
ID7117982 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp4860542 
End bp4861903 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content72% 
IMG OID643527285 
Productpentapeptide repeat protein 
Protein accessionYP_002423289 
Protein GI218532473 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.20747 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.0812819 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGACAGG GCGATGCGGA GGGCGATGAT CCGCGCGTCG CAGCGCTGCT CGCGCGTCTC 
TCGGACGACG AGGTGCCGTT CTCGATCGCC GGGCGCGACG ACCTGGCCGG ACTCACCCTG
TCGCGCGCCG CGCTGAACGG TCACATCGAT CCGACGTCGC CGCCGCGCTG GTGGATGGAG
GGCGGGGGCG GGCTCGACCT CGCCGGCGCC GACCTCGCCG ACGCCCGGCT GGAGATGACC
GACTTTTCCG ACGCCAACCT GCGTCGCGCC TCGCTCGCCG GAGCCCTCGC GCGCTCGGCC
GGCTTCGCGA ATGCCTGCCT GGAGGAAGCG GACTTTGCCG GCGCCGACCT CAGCGGCGCG
CGCTTTACCG GAATTGCCGG CGGGCAGGCC TCCTTCCGCG AGGCGATGCT GGAGGATGCC
GACTTCTCCG ACGCCACCAT GCGCTTTGCC CGGCTCGACA AGGCTTTGCT CGACGGCGCC
CGCTTCGAGG GCGCCGACCT CTGGGGCACC GACTTCACCG GGGCGGATGC CGACGATTCC
GTGTTCCGAA AGGCCCGGCT CGACGAGGCC AACCTCTCCG ACTGCAACCT GACCGGCGCG
GACTTCGAGG GGGCGAGCCT GAAGAAGGCG CGGCTCGTCG GCTCGCGGCT GCGCGGCGCC
AACTTCTCCG GGGCCCACCT CGACGGGGCG GACCTGTCGG GGGCCGACTT CTCCCGCACC
AGCCTCGTGC GGCTCGACCT CACGACGTGC AAGCTGCACC GCGCGCGCTT TGCCGGCGCG
TGGCTGGAAG GCGTGCGGCT CTCCGTCGAG CAGATCGGCG GGATGGTCGG CGAGGAGGCG
GCGGGCGAGT ACGAGGCGGC GCAGGCGAGC TATCTCGCGC TCGAGCGCAA CCTTCAGAGC
ATCGGCAGCC CCGAAGGCGC GAGCTGGGCC TACAAGCGCG GGCGCCGCAT GGGCCGCCGC
CATGCCGGCG TGCGGGCCCG CGAGGCCTTT TTCGCCCGCG ATGTGCGGGG AACGCTGAGC
TCCGGTTACC GCTGGATCGC CGACCGCTTC GTCGAGTGGC TGTGCGACTA CGGCGAGAGC
CTGTCGCGGA TCGCTCGCGC CTTCCTCGTC GGGATCTTCC TGTTCGCCGG GGCCTACGGC
GCGACGGGCG GGCTCTTCCA CGAGGGCGAG AACGCGCCGA CCTACAACCC GCTCGATCTC
GTGAGCTACA GCGCGCTCAA CATGATGACC GCCAACCCGC CCGAGATCGG GGTGAAGCCG
CTGGGCCGTG TCACCAACCT GCTGGTCGGG TTGCAGGGGG CGGCGGGGAT CGTGCTGATG
GGGCTGTTCG GCTTCGTCCT CGGCAACCGC CTGCGCCGCT GA
 
Protein sequence
MRQGDAEGDD PRVAALLARL SDDEVPFSIA GRDDLAGLTL SRAALNGHID PTSPPRWWME 
GGGGLDLAGA DLADARLEMT DFSDANLRRA SLAGALARSA GFANACLEEA DFAGADLSGA
RFTGIAGGQA SFREAMLEDA DFSDATMRFA RLDKALLDGA RFEGADLWGT DFTGADADDS
VFRKARLDEA NLSDCNLTGA DFEGASLKKA RLVGSRLRGA NFSGAHLDGA DLSGADFSRT
SLVRLDLTTC KLHRARFAGA WLEGVRLSVE QIGGMVGEEA AGEYEAAQAS YLALERNLQS
IGSPEGASWA YKRGRRMGRR HAGVRAREAF FARDVRGTLS SGYRWIADRF VEWLCDYGES
LSRIARAFLV GIFLFAGAYG ATGGLFHEGE NAPTYNPLDL VSYSALNMMT ANPPEIGVKP
LGRVTNLLVG LQGAAGIVLM GLFGFVLGNR LRR