Gene Mbar_A2049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A2049 
Symbol 
ID3625564 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp2590508 
End bp2591725 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content47% 
IMG OID637700927 
Productthreonine synthase 
Protein accessionYP_305563 
Protein GI73669548 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0498] Threonine synthase 
TIGRFAM ID[TIGR00260] threonine synthase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.381154 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATCATC TCAAATGTAT CGAATGCGGT GCAGAGTATT CCAAAGACGA AGTAATCTAT 
ACATGCAGCA AATGTGATGG GCTGCTTGAT GTAATTTATG ATTATTCCTC AGTAAAAATT
GACATGGAAA AACTGAAGAC TGAATGCCCT TCGGTCTGGA AATATGCAAA ACTTCTCCCT
ATTGAAAGAG AGCCTGTAAC TATCCAGGAA GGCGGAACTC CCCTTTACAA ATGCAACCGT
CTGGCTGAGA AAATCGGAAT TAAAGAACTC TACGTGAAGC ACGAAGGCAT GAACCCTACA
GGCTCTTTTA AGGACAGAGG AATGACTGTA GGAGTTACGA AAGCACTTGA ATTGGGAATG
AACACCGTTG CCTGTGCGTC TACTGGAAAT ACCTCGGCAG CCCTTGCAAT CTATGGGGCA
AAAGCCGGAA TTCCTGTAAT CGTACTTTTG CCTGCAGGAA AAGTTGCTCT TGGAAAAGTA
GCCCAGGCCC TTATGCACGG AGCAAAGGTC CTCAGCATTC GTGGAAATTT TGACGACGCT
CTCGCCCTTG TGCGCACTCT CTGTTCCCAG GAAAAAATCT ATCTCTTAAA CTCGATCAAC
CCCTACAGGC TGGAAGGCCA GAAGACTATC GGTTTTGAAA TTGCAGACCA GCTCGGTTTC
AAAGTACCTG ACAGAGTTGT CCTGCCTGTA GGAAATGCAG GAAATATCAC AGCTATCTGG
AAGGGTTTCA GGGAGTTTAA GAAGCTCGGC ATAACGGATT CGCTCCCGAA GATGACCGGG
ATTCAGGCCG CAGGCTCCTG TCCAATTGTA ACAGCTATAA AGAGCGAGGC TCCTGAAATC
ACCCCTGAGG AAAAACCCGA AACCGTTGCA ACAGCAATCA GGATAGGAAA CCCTGTTAAC
GCTAAAAAGG CTCTTGCTGC CATCCGGGAA TCCGGTGGGA CTGCGGAATC CGTTACTGAC
GAAGAAATCC TTACAGCCCA GAAAGACCTT GCAAGGCTTG AAGGAATAGG TGTCGAACCT
GCAAGTGCAG CTTCGGTTGC AGGGCTTAAG AAACTTGTTG ATATGGGTGT TATAAGCAGA
GACGAGACCG TTGTCTGTAT TACTACAGGA CACCTGCTTA AAGACCCGCA GACTGTAATT
GACATCTGTG AAAAACCTAT TGTTGTGGAT GCCAGTATAG AAGCCATCCG GGAAGCTATC
TTCGGAAAGG CAGAATAA
 
Protein sequence
MYHLKCIECG AEYSKDEVIY TCSKCDGLLD VIYDYSSVKI DMEKLKTECP SVWKYAKLLP 
IEREPVTIQE GGTPLYKCNR LAEKIGIKEL YVKHEGMNPT GSFKDRGMTV GVTKALELGM
NTVACASTGN TSAALAIYGA KAGIPVIVLL PAGKVALGKV AQALMHGAKV LSIRGNFDDA
LALVRTLCSQ EKIYLLNSIN PYRLEGQKTI GFEIADQLGF KVPDRVVLPV GNAGNITAIW
KGFREFKKLG ITDSLPKMTG IQAAGSCPIV TAIKSEAPEI TPEEKPETVA TAIRIGNPVN
AKKALAAIRE SGGTAESVTD EEILTAQKDL ARLEGIGVEP ASAASVAGLK KLVDMGVISR
DETVVCITTG HLLKDPQTVI DICEKPIVVD ASIEAIREAI FGKAE