Gene Mbar_A2069 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A2069 
Symbol 
ID3626406 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp2613069 
End bp2614730 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content48% 
IMG OID637700944 
Productdihydroxy-acid dehydratase 
Protein accessionYP_305580 
Protein GI73669565 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.297693 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.926907 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAGCG ATATCATCAA AGAGGGACCT GAACGTGCTC CCAACCGTTC ACTTCTGAAA 
GCGACAGGAG TCACAGATTC CGAGATGAGG AAACCATTCA TCGCAGTCGT TAATTCCTGG
AACGATATAA TTCCTGGCCA TATTCACCTG AATAAACTTG CTGAGGCTGT AAAAGCCGGA
ATCAGGAACG CCGGAGGAGT TCCTTTTGAA TTCCATACTA TAGGAGTCTG CGATGGAATT
GCAATGGGAC ATGAAGGTAT GAAATATTCC CTTCCCAGCA GAGAAGTTAT TGAGGACACG
ATAGAACTTA TGGTTAAAGC CCATCAGTTT GATGGGATTG TCCTTATTCC CACATGTGAC
AAGATCGTGC CGGGTCACCT GATGGCAGCA GGCAGACTTG ATATCCCTGC AATTGTCGTA
ACCGGAGGGC CTATGCTTCC TGGTTATGTC GATGACAAAT ATACAGACCT GATCTCGGTT
TTCGAAGGAG TGGGTGCTTA CAGCACAGGC AAACTCTCAG AACGTGATCT TAAAAGGCTT
GAAAATCTTT CCTGTGGAGG AGCCGGATCC TGTGCAGGGA TGTTTACCGC AAACACTATG
GCCTGTGTAA CCGAAGCACT TGGTCTGAGC CTTCCAGGCT GTGCAACTGC TCACGCAGTG
GATGCAAAAA AGGCCCGCAT TGCCAAGGAA TCCGGTGAGC GTATCGTCGA AATGGTAAAA
GAGAACCTGA CTCCCAGAAA GATTGTCACT TTTAAGTCTT TTGAAAACGC CATAATGGTA
GACATGGCAG TCGGAGGGAG CACAAATACC ACTCTGCACC TTCCTGCCCT TGCCCACGAA
TTCGGGCTGA ATCTTCCTCT TGAAAAATTT GATGAATTGA GCAGAGAAAC CCCTCACCTG
ATATCGCTTC GCCCTGGCGG CCCTAATTTT ATGCTACACT TTGACAGGGC AGGCGGGGTT
CAGGCAGTCA TGCAGAGGCT TTCCTCCAAA CTTCACCTGG ATCAGCTTAC AGTAAATGGC
AAGACTATCG GGGAAAACTT GAACGAGTTT GAGATTATTA ACCCGAAGCT CAATAAAGAG
ATAATTGCAA CTCTTGAAAA GCCTATACAT GCCGAAGGAG GAATTGCAGT CCTTAAAGGC
AACCTTGCCC CTAACGGTTC GGTCGTAAAA CAGGCAGCTG TTGACCCTAA AATGCGCGTT
CATACAGGCC CTGCCAAAGT TTATGACTGT GAAGAAGATG CTATGGAAAA TATCCTTGCA
GGCAATGTAA AACCTGGAGA TATTGTAGTT ATCCGCTACG AAGGCCCGAA AGGAGGGCCT
GGAATGAGGG AAATGCTTGC GGCTACAGCC GCAATAGGAG GTATGGGCCT GCTGGAGTCA
GTTGCTCTGG TAACTGACGG ACGCTTTTCA GGAGGCACAC GTGGGCCATG TATAGGACAT
ATCTCTCCCG AAGCAAGCGA AGGAGGGCCA ATTGCCCTGG TAAAAGATGG GGATATGATA
GAGATTAATA TTCCTGAAAG AGTTCTGAAC CTTAAAGTCT CAGAAGAAGA GCTTGAGCAA
AGAAAGGCGG CATTCGTACC TCCAAAAAAG GAAGTTACAG GCTACCTTGC AAGGTACCAG
CGTTCCGTTC ACTCTGCAAA TACAGGCGGA ATAGTGGACT GA
 
Protein sequence
MRSDIIKEGP ERAPNRSLLK ATGVTDSEMR KPFIAVVNSW NDIIPGHIHL NKLAEAVKAG 
IRNAGGVPFE FHTIGVCDGI AMGHEGMKYS LPSREVIEDT IELMVKAHQF DGIVLIPTCD
KIVPGHLMAA GRLDIPAIVV TGGPMLPGYV DDKYTDLISV FEGVGAYSTG KLSERDLKRL
ENLSCGGAGS CAGMFTANTM ACVTEALGLS LPGCATAHAV DAKKARIAKE SGERIVEMVK
ENLTPRKIVT FKSFENAIMV DMAVGGSTNT TLHLPALAHE FGLNLPLEKF DELSRETPHL
ISLRPGGPNF MLHFDRAGGV QAVMQRLSSK LHLDQLTVNG KTIGENLNEF EIINPKLNKE
IIATLEKPIH AEGGIAVLKG NLAPNGSVVK QAAVDPKMRV HTGPAKVYDC EEDAMENILA
GNVKPGDIVV IRYEGPKGGP GMREMLAATA AIGGMGLLES VALVTDGRFS GGTRGPCIGH
ISPEASEGGP IALVKDGDMI EINIPERVLN LKVSEEELEQ RKAAFVPPKK EVTGYLARYQ
RSVHSANTGG IVD