Gene M446_4052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_4052 
Symbol 
ID6135844 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp4519178 
End bp4520428 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content67% 
IMG OID641644207 
Productformyl-coenzyme A transferase 
Protein accessionYP_001770847 
Protein GI170742192 
COG category[C] Energy production and conversion 
COG ID[COG1804] Predicted acyl-CoA transferases/carnitine dehydratase 
TIGRFAM ID[TIGR03253] formyl-CoA transferase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.739861 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAGC CCCTCGAAGG CATCAAGATC ATCGACTTCA CGCACGTCCA GGCCGGGCCG 
GCCTGCACCC AGCTCCTCGC CTGGTTCGGC GCGGACGTGA TCAAGGTGGA GCGGCCCGGC
GCAGGCGACG TCACCCGCAC CCAGCTGCGG CACGTCGAGG ATGCGGACGC GCTCTACTTC
ACGATGCTGA ACTCCAACAA GCGCTCGCTC ACCCTCGACA CCAAAACCCC GCAGGGCAAG
GAAGTCCTGG AGAAGCTGAT CAAGGAATCC GACGTCCTCG TCGAGAATTT CGGCCCCGGC
GCCCTCGACC GCATGGGCTT CTCCTGGGCC CGGATCAACG AGCTCAACCC GGGCATGATC
GTCGCCTCGG TGAAGGGCTT CAGCGAGGGC CACCACTACG AGGACCTGAA GGTCTACGAG
AACGTGGCGC AGTGCGCGGG CGGCGCGGCC TCGACGACCG GCTTCTGGGA CGGCCCCCCG
ACCGTGAGCG GCGCGGCGCT CGGTGATTCG AACACCGGCA TGCACCTCGC GATCGGCATC
CTCACGGCGC TGCACGCCCG CAACAAGACC GGCAAGGGCC AGAAGGTGGC CGTGTCGATG
CAGGACGCGG TGCTCAACCT CTGCCGCGTC AAGCTGCGCG ACCAGCAGCG CCTCGACGCG
CTCGGCTACC TGGAAGAGTA CCCGCAATAC CCGCACGGCG AGTTCAGCGA CGCGGTGCCG
CGCGGCGGCA ACGCGGGCGG CGGCGGCCAG CCCGGCTGGG TGCTGAAGTG CAAGGGCTGG
GAGACCGATC CCAACGCCTA CATCTACTTC ACGATCCAGG GCCACGCCTG GGCGCCGATC
TGCCGCGCCC TCGGCAAGGA GGAGTGGATC GAGGATCCGG CCTACAACAC CGCCCGCGCC
CGCCAGGACA AGATCTTCGA GATCTTCGCC TTCATCGAGA GCTGGCTGGC CGACAAGACC
AAGTACGAGG CCGTGGACAT CCTGCGCAAG TTCGACATCC CCTGCGCGCC GGTGCTGTCC
ATGAAGGAGA TCGCCGCGGA CAAGTCGTTG CGGGCGAGCG GCTCGATCGT CGAGGTGCAG
CACCCGCAGC TCGGCAAGTA CCTGACCGTC GGCAGCCCGA TCAAGTTCTC GGACCTGAAG
GTCGAGGTCA AGGCGTCGCC GCTCCTCGGC GAGCACACCG ACGAGGTGCT GCGCGACCTC
GGCTACACCG AGCAGCAGAT CGAGATGCTC CACCAGGAGC GCGCGGTCTA A
 
Protein sequence
MSKPLEGIKI IDFTHVQAGP ACTQLLAWFG ADVIKVERPG AGDVTRTQLR HVEDADALYF 
TMLNSNKRSL TLDTKTPQGK EVLEKLIKES DVLVENFGPG ALDRMGFSWA RINELNPGMI
VASVKGFSEG HHYEDLKVYE NVAQCAGGAA STTGFWDGPP TVSGAALGDS NTGMHLAIGI
LTALHARNKT GKGQKVAVSM QDAVLNLCRV KLRDQQRLDA LGYLEEYPQY PHGEFSDAVP
RGGNAGGGGQ PGWVLKCKGW ETDPNAYIYF TIQGHAWAPI CRALGKEEWI EDPAYNTARA
RQDKIFEIFA FIESWLADKT KYEAVDILRK FDIPCAPVLS MKEIAADKSL RASGSIVEVQ
HPQLGKYLTV GSPIKFSDLK VEVKASPLLG EHTDEVLRDL GYTEQQIEML HQERAV