Gene Mbar_A3503 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A3503 
Symbol 
ID3624901 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp4492772 
End bp4494070 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content47% 
IMG OID637702330 
Producttryptophan synthase subunit beta 
Protein accessionYP_306954 
Protein GI73670939 
COG category[R] General function prediction only 
COG ID[COG1350] Predicted alternative tryptophan synthase beta-subunit (paralog of TrpB) 
TIGRFAM ID[TIGR01415] pyridoxal-phosphate dependent TrpB-like enzyme 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.385411 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.814215 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACAGA CCAAGATTAT CCTTGACGAG AATGAAATGC CTAAACAGTG GTATAACGTC 
CTTTCGGACT TATCTTCTCC TATCGAGCCG CCTCTTGATC CAAGAACCCG GGAGCCTATA
AGCCCTGAAG CCCTTGAACC TATTTTTCCA AAAGGTCTTA TCAAGCAGGA AATGAGCAGT
GAAAGGTTTA TTGATATCCC TGAAGAGATA CTTGAAATTT ACAGACTCTG GAGACCCAGT
CCTCTTTACA GGGCTCACAG GCTGGAAAAA CTTCTCAAAA CCCCTGCAAA GATATACTAT
AAAAATGAAG GGGTAAGCCC CGCAGGCAGC CATAAGACAA ATACCTCAAT TGCCCAGGCC
TACTATAATA TGAAAGAAGG AACCGAGAGG ATCACAACCG AAACAGGGGC AGGGCAGTGG
GGAAGTGCAT TGGCTCTTGC CTGCAATTAT TTTGATCTCG AATGTAAAGT CTATATGGTC
CGTTCCAGTT TTTACCAGAA ACCTTACCGG AAATCCATGA TAACAATCTG GGGAGGAAAT
GTTGTGCCCT CGCCAAGTGA AGATACCGAA TTCGGGAGAA AGATTCTGAA GGAGCAGCCG
GAAACACCGG GAAGCCTGGG AATAGCAATC AGTGAAGCCG TAGAGGATGC AATCGCACAC
GATAGCACTA AATATTCCCT TGGAAGCGTG CTCAACCATG TTATGCTCCA CCAGACGATA
ATCGGGGCCG AATGCAAGAA GCAGCTTGAA CAAGTTGAAA CCTATCCTGA TATAGTTATA
GGCTGCTGCG GAGGAGGAAG CAACCTCGCG GGAATATCTC TTGAGTTTAT TAAAGACAAG
ATTGAAGGAA AGAGGAATCC AAGAGTAATT GCAGTCGAGC CCTCAGCCTG CCCGTCCCTG
ACCAAAGGAG AATACAGGTA CGACTTCGGA GACACTGCGG AAATGACCCC ACTTCTTAAA
ATGTATACCC TGGGACACAA ACATGTGCCT CCTGCCATCC ACGCAGGCGG ACTCCGTTAC
CATGGTGACT CCCCAATCAT AAGTAAACTC TGTGATGAAG GGTTTATCGA AGCCGTTTCA
TATGATCAGT ATCCTGTATT TGATGCCGCT GTGCAGTTCG CACGGACTGA AGGCATAGTC
CCTGCTCCGG AATCCGCGCA TGCTATCCGC TGTGCAATCG ACGAAGCTAT CAAATGCAAG
CAAACCGGAG AAGAAAAAAC AATTCTCTTC AACCTGAGTG GACACGGACA TTTCGACATG
AGTTCATATG ACAAATATTT CAACAAAGAA CTTGCCTGA
 
Protein sequence
MEQTKIILDE NEMPKQWYNV LSDLSSPIEP PLDPRTREPI SPEALEPIFP KGLIKQEMSS 
ERFIDIPEEI LEIYRLWRPS PLYRAHRLEK LLKTPAKIYY KNEGVSPAGS HKTNTSIAQA
YYNMKEGTER ITTETGAGQW GSALALACNY FDLECKVYMV RSSFYQKPYR KSMITIWGGN
VVPSPSEDTE FGRKILKEQP ETPGSLGIAI SEAVEDAIAH DSTKYSLGSV LNHVMLHQTI
IGAECKKQLE QVETYPDIVI GCCGGGSNLA GISLEFIKDK IEGKRNPRVI AVEPSACPSL
TKGEYRYDFG DTAEMTPLLK MYTLGHKHVP PAIHAGGLRY HGDSPIISKL CDEGFIEAVS
YDQYPVFDAA VQFARTEGIV PAPESAHAIR CAIDEAIKCK QTGEEKTILF NLSGHGHFDM
SSYDKYFNKE LA