Gene M446_2072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_2072 
Symbol 
ID6134431 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp2312855 
End bp2314015 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content75% 
IMG OID641642301 
Producthypothetical protein 
Protein accessionYP_001768969 
Protein GI170740314 
COG category[S] Function unknown 
COG ID[COG4641] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.236468 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0105803 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCCGCC CCCTCGACCT CGTCGTCCTC GGCCTCAGCC TGTCCTCGTC CTGGGGCAAC 
GGCCACGCCA CCACCTACCG GGCGCTGCTG CGGGCCTTCG CGGCGCGCGG CCACCGCGTC
ACCTTCCTGG AACGGGACGT GCCCTGGTAC GCGGCCCATC GCGACCTCGC CGCGCCGGAT
TACTGCGACC TCGTCCTCTA TCCCGACCTC GCGGCCCTGC GCGACCTGCG CCCGCGCCTG
CTGCGGGCGG ACGCGGTGAT GGTCGGCTCC TACGTGCCGG AGGGCGTCGC GGTCGGCGCC
CTGGCGGTCG CGACGATGCG GGAGGCGGGG GGCGTCGCCG CCTTCTACGA CATCGACACG
CCGGTGACGC TCGCCAAGCT CGCCCGGGGC GACCACGAGT ACCTCACCCC CGACCTGATC
CGCGCCTACG ACCTCTACCT CTCCTTCACG GGCGGGCCGG TGCTGGAGCG CCTGGAGCGG
GAATTCGGCG CGCCCCGCGC CCGCGCCCTC TACTGCTCGG TCGATCCCGC CCTCTACGCG
CCGACCGGCG CGGAGCCGGT CTACGACCTC TCCTATCTCG GCACCTACAG CCCGGACCGG
CAGCCGACCC TGGAGCGGCT CCTGATCGAG CCCGCGCGGC GGGCGCCCGA GCTGCGCTTC
GTGGTCGCCG GGCCGCAATA TCCCGCCGAC ATCGCCTGGC CGCCGAACGT CGAGCGGCGC
GACCACGTCG GCCCCGCCGA TCACCCGGCC TTCTACGGCC TGAGCCGCTG GACCCTGAAC
GTCACCCGCG CCGACATGCG CGCGGCCGGC TACAGCCCGA GCGTCCGCCT GTTCGAGGCC
GCCGCCTGCG GCACGCCGAT CCTCTCGGAC GACTGGCCGG GCCTCGGCAC GATCCTGGCG
CCGGGCCGCG AGATCGTGGT GGCCGAGGGC CCCGACGCGG TGCTGTCGGC GCTCACCCGG
ACGAGTCCGG CCGAGCGCGC CGCCCTGGCG CAGGCGGCCC GCCGCCGGGT GCTGGCCCGG
CACAGCGCCG CGCAGCGGGC CCAGGAACTC GAGGCGGCGC TCCTCGAGGC GGCGCTGCGC
GAGGCGGCGG CGCCTTCGCC CAAATACTCG CATGAAGTAT CGAAACTCCC GCTTGCCGAG
GGCGTTAGGG GGCGGAGCTA A
 
Protein sequence
MTRPLDLVVL GLSLSSSWGN GHATTYRALL RAFAARGHRV TFLERDVPWY AAHRDLAAPD 
YCDLVLYPDL AALRDLRPRL LRADAVMVGS YVPEGVAVGA LAVATMREAG GVAAFYDIDT
PVTLAKLARG DHEYLTPDLI RAYDLYLSFT GGPVLERLER EFGAPRARAL YCSVDPALYA
PTGAEPVYDL SYLGTYSPDR QPTLERLLIE PARRAPELRF VVAGPQYPAD IAWPPNVERR
DHVGPADHPA FYGLSRWTLN VTRADMRAAG YSPSVRLFEA AACGTPILSD DWPGLGTILA
PGREIVVAEG PDAVLSALTR TSPAERAALA QAARRRVLAR HSAAQRAQEL EAALLEAALR
EAAAPSPKYS HEVSKLPLAE GVRGRS