Gene Mboo_2043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_2043 
Symbol 
ID5411170 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp2119131 
End bp2120606 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content59% 
IMG OID640869285 
Producthomoserine O-acetyltransferase 
Protein accessionYP_001405200 
Protein GI154151582 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG2021] Homoserine acetyltransferase
[COG2524] Predicted transcriptional regulator, contains C-terminal CBS domains 
TIGRFAM ID[TIGR01392] homoserine O-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.633415 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTCGTG GCTCGCTAGG CATCGTTACG ACCCAGTACG CAGACCTCCC CGGTCCCTTT 
ACCCTGGAGA GCGGGGCGGT GCTGCCCGAA ATAAGGATTG CCTACGAGAC CTACGGCAGG
CTCAACAAGG AAAAGAGCAA CGCGATCCTT CTCTGCCATG CCCTCTCGGG CGACGCTCAC
GTAGCCGGGT TCCATAACGG GGAAACAAAA CCCGGCTGGT GGGACGCAGT GGTAGGGCCG
GGAAAGGCGT TTGATACCGA GCGTTACTTT GTTATCTGCA GTAACGTGCT TGGGGGGTGT
AAGGGCTCGA CCGGCCCTTC CACCATCAAC CCCGAAACGG GCAAACCCTA TGGCGCAACC
TTCCCGGTGG TGACCATCCG GGACATGGTT AACGCCCAGA AACTTCTTCT TGACAGCCTT
GGTATTCCCG AACTCTATGC GGTTGCAGGC GGCTCGATGG GGGGGATGCA GGCGCTCCAG
TGGACTGTCT CCTACCCGGA TCTTATAAAA AAGGCGGTCA TCATCGCCAC GACCGGCTAC
TCCACCCCCC AGCAGATCGC CTTTAACGAG GTGGGCCGGA AGGCGATTCT CTCCGATCCC
GACTGGAGTG GCGGGGACTA TTACGGGAAA AAGACCCCTG CCCATGGCCT CGCCCTTGCC
CGGATGGTGG GCCACATCAC CTACCTCTCC GATGAATCGA TGCACGCGAA GTTCGGGCGC
TCGCTCCAGG GAAAGGCGCA GGTGGGCTTT GATTTCTCCA CCGAGTTTGC CATCGAGAGC
TACCTCCACC ACCAGGGTGA TACGTTCACA AAAAGGTTCG ATGCAAACTC GTATCTCTAC
ATCACCAAGG CCATCGATTA CTTTGATCTC ACCAAAGACG GGTCCCTGAC CACCGGCCTT
GCAGCGGCAA AGGCTGCGTT CTTTGTTATC TCCGTTACCT CGGACTGGCT GTACCCTCCT
TACCAGTCGC AGGAGATTGT TACCGCCCTT ACCACGAACG AGCGCGAGGT ACAGTACTGC
GAGATCCGGT CCAACTACGG CCATGATGCG TTCCTCCTTG AATCCGGGCA GCTCAACTAC
CTGATCTCCC GGTTCCTCTC CCATACTGTC GTAGGTGACG TGATGGCGAG GAACGTGGAG
TGCATCGAGG AAGGTACCAC GATCGCGGTT ACCGCCCGGC GGATGATCAC CAGTGGTGTC
AATCACCTGC CGGTTCTCTC CCCTGCCGGC CAGCTCGTGG GAATCGTGAC CTCGTGGGAC
ATTGCAAAGG CGGTGGCTTC CAATTTCCTG TGGCTTGACG AGATTATGAG CCGGAACGTG
GTCACGACCA CCGAGAACGA GCCCGTTGAC GAGGCGGCGC GCAAGATGGA GGCCCACTCG
ATCTCCGCGC TTCCGGTGAT CGATGGCGAC TCGCACGTGA TCGGTCTTAT TACAAGCGAT
GCGATCAGTA CCCTTGTAGG GAGACAGAAC CCGTGA
 
Protein sequence
MLRGSLGIVT TQYADLPGPF TLESGAVLPE IRIAYETYGR LNKEKSNAIL LCHALSGDAH 
VAGFHNGETK PGWWDAVVGP GKAFDTERYF VICSNVLGGC KGSTGPSTIN PETGKPYGAT
FPVVTIRDMV NAQKLLLDSL GIPELYAVAG GSMGGMQALQ WTVSYPDLIK KAVIIATTGY
STPQQIAFNE VGRKAILSDP DWSGGDYYGK KTPAHGLALA RMVGHITYLS DESMHAKFGR
SLQGKAQVGF DFSTEFAIES YLHHQGDTFT KRFDANSYLY ITKAIDYFDL TKDGSLTTGL
AAAKAAFFVI SVTSDWLYPP YQSQEIVTAL TTNEREVQYC EIRSNYGHDA FLLESGQLNY
LISRFLSHTV VGDVMARNVE CIEEGTTIAV TARRMITSGV NHLPVLSPAG QLVGIVTSWD
IAKAVASNFL WLDEIMSRNV VTTTENEPVD EAARKMEAHS ISALPVIDGD SHVIGLITSD
AISTLVGRQN P