Gene Msil_1023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1023 
Symbol 
ID7091851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp1109882 
End bp1111066 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content64% 
IMG OID643464362 
Producthomoserine O-acetyltransferase 
Protein accessionYP_002361354 
Protein GI217977207 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2021] Homoserine acetyltransferase 
TIGRFAM ID[TIGR01392] homoserine O-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.00432777 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGACGATCG TCTTGACCCA TAAGCTCAGC GGCCTGCGCG AAGTCGACGC GCCGCATAGT 
CTCGTCGCGC ATTTCGGACC AGATCACGCG CTGCAGATGG ATTCGGGCGG CCGGCTCAAT
CAATGGACGA TCGCCTATCA GACCTATGGC GAACTCAACG CCGCCAAATC CAACGCCATT
CTCGTCTGCC ATGCTCTGAC CGGCGATCAG CATGTCGCCA ACGCGCATCC GGTAACGGGC
AAGCCCGGCT GGTGGAGCAC CATGGTCGGT CCCGGCCGGC CGATCGACAC CGATCGCTAT
TTCGTCATCT GCTCGAATGT GATCGGCGGC TGCATGGGCA CGACCGGTCC GGCCTCGCTC
AATCCGCAGA CAGGCCGGCC GTACGGGCTT GAGCTGCCGA TCGTGACGAT CCGCGACATG
GTCCGGGCGC AGGCGATGCT GATCGACCAC CTTGGCGTCG ATACGCTGTT TTGCGTCGTC
GGGGGCTCGA TGGGCGGCAT GCAGGTGCTG CAATGGGTTG CGAGCTTCCC CGAGCGCGTC
TTCTCGGCCA TGCCGATCGC CACGGCGGCG AAACATTCCT CGCAAAACAT CGCCTTTCAC
GAGGTCGGCC GGCAGGCCGT GATGGCCGAT CCCGACTGGC GCAAGGGCCG CTATCTCGAG
GAAGGGGTCA TCCCCACCAA AGGCCTCGCC GTCGCCCGCA TGGCGGCGCA TATCACCTAT
CTGTCCGACG AGGCGCTGCA GAGCAAATTT GGCCGCAAGC TACAGGACCG CGACGCGCCG
ACCTTCTCCT TCGACGCCGA ATTCCAGATC GAGAATTATC TGCGCTATCA GGGCTCGAGC
TTCGTCGACC GGTTCGATCC GAACTCCTAT CTTTATGTGA CCCGAGCTTG CGACTATTTC
GACCTGGCCG CCGACTACGA CGGATCGCTG GCGCGCGCCT TTCAGGGGGT CAAGGCGCGC
TTTTGCGTCG TCTCGTTCAA TTCCGACTGG CTCTATCCGA CCGCCGCCTC GCGCGCCATC
GTGCACGCCC TGAACGCCGG GGGCGCCTCG GTCTCCTTCG TCGACATCGA GACCGATCGC
GGCCACGACG CCTTTCTGCT CGACCTGCCG GAGTTCATCG CCACCTCGCA GGGCTTTCTC
GATTCGGCCG CCAAGGCTCG CGGCCTGCCG CCGGCCGCGC CTTGA
 
Protein sequence
MTIVLTHKLS GLREVDAPHS LVAHFGPDHA LQMDSGGRLN QWTIAYQTYG ELNAAKSNAI 
LVCHALTGDQ HVANAHPVTG KPGWWSTMVG PGRPIDTDRY FVICSNVIGG CMGTTGPASL
NPQTGRPYGL ELPIVTIRDM VRAQAMLIDH LGVDTLFCVV GGSMGGMQVL QWVASFPERV
FSAMPIATAA KHSSQNIAFH EVGRQAVMAD PDWRKGRYLE EGVIPTKGLA VARMAAHITY
LSDEALQSKF GRKLQDRDAP TFSFDAEFQI ENYLRYQGSS FVDRFDPNSY LYVTRACDYF
DLAADYDGSL ARAFQGVKAR FCVVSFNSDW LYPTAASRAI VHALNAGGAS VSFVDIETDR
GHDAFLLDLP EFIATSQGFL DSAAKARGLP PAAP