Gene Mboo_1354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1354 
Symbol 
ID5410600 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp1379175 
End bp1380161 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content56% 
IMG OID640868586 
Producthomoserine dehydrogenase 
Protein accessionYP_001404515 
Protein GI154150897 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0460] Homoserine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0271184 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.0214619 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGCGG CGCTGATCGG GCTTGGCTCG GTGGGACGCG GTGTTCTTGA AATACTTGCC 
AACAAAAATC TCGGTATCAC TATCACCGGG ATCGCCGATT CCAAGAGCGG GTGCATCGAT
AACGCCGGCA TTGATCCTGA GGTTGTGCTT AAGGAAAAGC AGAGGACCGG CTTGTGCGGT
GACCGCAGGA TCGATGCGGC TGCGGTGATC AGGAACGCGG ACTATGAGGT TCTTATCGAA
GTTACCCCGA CCAATGCCCT GACCGGAGAA CCGGCTCTTG GGTACATACG GGCAGCCCTG
GCACGAAAGA AGCACATTGT CACCTCCAAC AAAGGCCCGA TTGCCCTTGC TTACCGCGAT
CTTGCGGGGC TTGCACAGAA GAAAGAAGTG GCGCTCCGGT ACGAGGCTAC GGTTGGCGGG
GCAATCCCGA TCATGCATAC ACTCCAAGAC GGCCTGTGCG GGAACAGGAT TGTCGCGGTC
CATGGGGTTC TCAACGGAAC CTGCAATTAC ATCCTTACCC GTATGGCTGC CGAGGGACTC
ACCTACGAAC AGGCACTGCT GGAGGCTCGG GAGATGGGAT ATGCCGAGGC CGATCCCACC
TACGATGTAA AAGGGATCGA TGCTGCTATA AAACTCGTCA TCCTAGCAAA TACGGTCTGG
GACAATGGTG TCACGCTTGC CGATATTGAT ATCACCGGCA TCGACCTCCT CACCCCGGAC
GCCCTGCGCT TGGCTGAGGA AGGGGACAGC ACCATCCGCC TGATCGCTGA GGCCATCCCG
GATAAGAAAA TATTCCGGGT CTCGCCGCGC ATGATCGAAA AAAGCCACCC TCTCGTAGTC
GAAGGATCGC TGAATGCGCT CACCCTCGAG ACCGACATGG CAAAGGAGAT CACGCTGATT
GGAAAAGGTG CCGGATCGAT CGAGACGGCG AGTGCGATTA TCGGAGATAT CCTGTATATC
CGCGACCATT ATGGCAAGCG TGCTTGA
 
Protein sequence
MKAALIGLGS VGRGVLEILA NKNLGITITG IADSKSGCID NAGIDPEVVL KEKQRTGLCG 
DRRIDAAAVI RNADYEVLIE VTPTNALTGE PALGYIRAAL ARKKHIVTSN KGPIALAYRD
LAGLAQKKEV ALRYEATVGG AIPIMHTLQD GLCGNRIVAV HGVLNGTCNY ILTRMAAEGL
TYEQALLEAR EMGYAEADPT YDVKGIDAAI KLVILANTVW DNGVTLADID ITGIDLLTPD
ALRLAEEGDS TIRLIAEAIP DKKIFRVSPR MIEKSHPLVV EGSLNALTLE TDMAKEITLI
GKGAGSIETA SAIIGDILYI RDHYGKRA