Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2218 |
Symbol | |
ID | 3830825 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2314109 |
End bp | 2314981 |
Gene Length | 873 bp |
Protein Length | 290 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637830140 |
Product | glycine cleavage H-protein |
Protein accession | YP_431050 |
Protein GI | 83591041 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0509] Glycine cleavage system H protein (lipoate-binding) |
TIGRFAM ID | [TIGR00527] glycine cleavage system H protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00000000131988 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000190613 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGACTGCAA AGTATGTTAT CTTGCCCTGT AACGGCCTGG ATAAAGAAGC CGGCTGCCTG GCCCGGGAAC TGGCCCTGAA AATGGCGGCG GCCACCGGGA GCGAGATTAT CTGCCCGGTG CTTTACCAGA CGGCGCCGTC CCGTTATGCT TCTCTTCTCC GGGAAGGTAG CCTGGTGGTT ATCGACGGCT GCGCCACCCG TTGCGCCAGC CGGATTGCCG CCAACAATAA CCTGAAGATT TACCGCAAGA TAACTATGAC CGAAGAAGCT AAAAAGAGGG ATTATAACCC CGGCCCGGAC TGGCGGGTGG GGGCAGGGGC GGCGGCCTTC GTGGAGGATG TCTGGCGGTC CTGGCAGCCT GTACTCAGGG AACAGGAAGC CGGCCGGGCG CCGGGAGAAA ACGCCGGCCT TTTTGCCGGC CCGCTGGAAT ACGCCATCCA TCGCCACGAT AAGTTTATCT TCCGGGTACC CCTGGAAGGC TTTTACTTCA ACGAGAACGA CTGCTGGGTG CAAGTAGAGG GGAACAGGGG CCGCGTAGGC ATCAGCGACT ACCTGCAGCA AAACCTTTCC GATATTACCT TTGTCACGCC CCCCGATCCG GGTACGGAAG TGGAGCAATT CGGCGAAATG GGTACCATCG AGTCGGCCAA AGCGGTTTAT GAACTGGTTT CCCCGGTTAC AGGCAGGGTG GTGGCCGTTA ATGAAGCCAT CCTAGAGGCG CCGGAACTCA TCAATGAAAA CCCCTATGAA AAGGGCTGGA TTACCGAACT GGAACTGACC AATTTTGAGG CTGACCGCGA ATTTTTACTG GACGGGCGGC GCTATATGGA AGTATTAAAG GAAAAGGTGG CTGATTTCAA TGCCGGTAAA TAG
|
Protein sequence | MTAKYVILPC NGLDKEAGCL ARELALKMAA ATGSEIICPV LYQTAPSRYA SLLREGSLVV IDGCATRCAS RIAANNNLKI YRKITMTEEA KKRDYNPGPD WRVGAGAAAF VEDVWRSWQP VLREQEAGRA PGENAGLFAG PLEYAIHRHD KFIFRVPLEG FYFNENDCWV QVEGNRGRVG ISDYLQQNLS DITFVTPPDP GTEVEQFGEM GTIESAKAVY ELVSPVTGRV VAVNEAILEA PELINENPYE KGWITELELT NFEADREFLL DGRRYMEVLK EKVADFNAGK
|
| |