Gene Mbar_A3541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A3541 
Symbol 
ID3627563 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp4542771 
End bp4544021 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content47% 
IMG OID637702367 
ProductL-threonine synthase 
Protein accessionYP_306990 
Protein GI73670975 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0498] Threonine synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0314915 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAAACT TCAAACTGAA ATGTCTTAAA TGCGGAAGAG AGTACGGCCA GGAATACAGA 
CTCACCTGTG AGAATGATAA CGCCTTTTTA CGGGCGGAGT ACTCAGAAAA AAGACTGGTG
TTAAGAAACC AGCCAGGTAT TGGAAGGTTT CATTCCTGGC TCCCGGTCCA GGAAGAACTC
ACTACGGATG CAGGACCCAT AACTTACAAA AGTGAAGCTT TTGCTAGAGA ACTCGGGCTC
TCTAACCTTT ATATAGGATT CAGCGGGTAC TGGCCTGAAA GAGGGGCTTT CATTAAAACC
TGCAGTTTTA AAGAACTTGA AGCCCACCCA ACCATGCAAC TCCTGAAAGA GACCGGAGGA
AAAGCCGTAG TCCTTGCCTC TGCAGGAAAT ACCGGTCGGG CCTTTGCGCA CGTATCGGCT
TTGACAGGAA CTGATGTCTA CATAGTTGTA CCGGAATCCG GAGCTTCGAA GCTGTGGCTG
CCTGAGGAGC CTACTGAATC TGTCCATCTC ATCAGTATGA GTCCAGGAAA CGATTACACT
GACGCTATCA ATCTCGCAGG CAGGATTGCA AAGCTGCCTG GTATGGTATC CGAAGGTGGA
GCAAGAAATA TCGCCAGAAG AGATGGAATG GGCACTGTGA TGCTGGATGC AGCAGTAACT
ATAGGAAAAA TGCCTGATCA CTACTTCCAG GCAGTAGGAA GCGGTACAGG AGGAATTTCA
GTATGGGAAG CTGCAATGCG TCTCAGAACC GATGGGCGGT TCGGGCAAAA ACTCCCGAAA
CTCCAGCTTG CCCAGAACCT TCCTTTTGTT CCCATGTACA ATGCCTGGCA GGAAAAAAGA
AGAGAAATTA TTCCCGAACT TGACATGAAG GATGCAAAGA AACAGGTAGA AGAAACCTAT
GCAACCGTGC TTACCAACCG TACTCCACCA TATGGGGTTA TGGGCGGGTT ATACGATGCA
CTTACCGACA CTGACGGAAT AATGTACGCA ATTACCAGAG AAGAAGCCCT TGAAGCTAAG
GCTCTTTTCG AATCTCTTGA AGGAATCGAC ATTCTCCCTC CATCAGCAGT TGCAACAGCC
TCCCTATTAA AAGCTGTGGA AGAAGGAAAT GTTAGTAAGG ATGAAACTAT TCTTCTGAAC
CTTGCAGGCG GAGGATATAA ACGCCTGAAA GAGGACTACA CACTTTATCA GATCGAGCCG
GTAGCTACTG CTAAAAATCC TGATATTTCC TTAGACGAGC TGAAGATCTA G
 
Protein sequence
MGNFKLKCLK CGREYGQEYR LTCENDNAFL RAEYSEKRLV LRNQPGIGRF HSWLPVQEEL 
TTDAGPITYK SEAFARELGL SNLYIGFSGY WPERGAFIKT CSFKELEAHP TMQLLKETGG
KAVVLASAGN TGRAFAHVSA LTGTDVYIVV PESGASKLWL PEEPTESVHL ISMSPGNDYT
DAINLAGRIA KLPGMVSEGG ARNIARRDGM GTVMLDAAVT IGKMPDHYFQ AVGSGTGGIS
VWEAAMRLRT DGRFGQKLPK LQLAQNLPFV PMYNAWQEKR REIIPELDMK DAKKQVEETY
ATVLTNRTPP YGVMGGLYDA LTDTDGIMYA ITREEALEAK ALFESLEGID ILPPSAVATA
SLLKAVEEGN VSKDETILLN LAGGGYKRLK EDYTLYQIEP VATAKNPDIS LDELKI