Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2043 |
Symbol | |
ID | 3831189 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2133173 |
End bp | 2134438 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637829972 |
Product | phosphoribosylamine--glycine ligase |
Protein accession | YP_430882 |
Protein GI | 83590873 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0151] Phosphoribosylamine-glycine ligase |
TIGRFAM ID | [TIGR00877] phosphoribosylamine--glycine ligase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 47 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.453314 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTATCC TGGTAGTTGG CGGCGGCGGC CGGGAACACG CCCTGGTCTG GAAGATTGCC CGGAGTCCCA GGGTTACGGA AATCTACTGC GCCCCGGGCA ACGCCGGCAT CGCCGGCCTG GCCTACTGTG TGCCCATTGC TGCCGATGAT ATTACCGGCC TGCTGGACTT CGCCCGTCGG GCTGCCATAG ATCTCACCGT CGTCGGTCCC GAGGCCCCCC TGGTGGCCGG TATCGTCGAC GCCTTCCAGG CGGCCGGCCT GCCAATCTTT GGCCCCTCCC GGGGAGCGGC CCGGCTGGAA GGAAGCAAGA TTTTCGCCAA AGAGTTGATG CAGGAAGCAG GCATCCCCAC GGCTGCCCAC GCTACCTTTA CGGAGCCCGG TCCGGCCCTG GCCTACCTGG AGGAACACCC CGGACCGGTG GTGGTCAAGG CCGACGGCCT GGCAGCCGGC AAGGGCGTGG TGGTGGCCCC GGACGCGGCT ACGGCCCGAG CAGCGGTCCG GGATATGTTT GGAGGCAAAT TTGGTCCCGC CGGCCGGAGG GTAATCCTCG AAGAACGCCT GGAAGGGGAA GAAGTCAGCA TCCTGGCCCT GTGCGATGGG AAAACAGCTA TTCCCCTCCT CCCTTCCCAG GACCATAAGC GGGTGGGGGA AGGGGATACC GGCCCCAATA CCGGCGGCAT GGGCGCCTAC GCGCCGGTGC CCTTCTATAC CCCGGAGATT GCCGCCCAGG TGGAAGAAAG GGTTCTCCGG CCGGTTATCC GGACCATGGC CGCAGCCGGG CACCCCTACC GGGGCGTCCT TTACGCCGGG CTGATGCTTA CCCGGGAAGG TCCCAAAGTC CTGGAGTTCA ACTGTCGCTT CGGCGACCCG GAAACCCAGC CCCTGATGCT CCTTTTGGCA AGCGACCTGG TGGAGCTGAT GCTAGCTGCC GTCAACGGTG AGCTGGCGGG AACGAGGATC GCCTGGTACC CCGGTGCCGC CGCCGGGGTG GTCCTGGCGG CGGGCGGTTA TCCGGGCCCG TACGCCAAGG GTGATTTTAT TACGGGATTG GAGGCAGTGG CACCGGGGGT GGAAGTCTTC CACGCCGGCA CCGCCCTGGT GGACGGCCAG GTCGTAACGT CCGGGGGCCG GGTGGTGTGT GTAACCGCCC GGGGTAAAGA TTTACGGGCC GCTTTGGACC GCGTTTATGA CAGTATCAGG GCCATCCACT TCCAGGGAAT GCACTACCGC CGGGATATCG GCCGTCGGGC CTTGGAGGCA ACTTAA
|
Protein sequence | MRILVVGGGG REHALVWKIA RSPRVTEIYC APGNAGIAGL AYCVPIAADD ITGLLDFARR AAIDLTVVGP EAPLVAGIVD AFQAAGLPIF GPSRGAARLE GSKIFAKELM QEAGIPTAAH ATFTEPGPAL AYLEEHPGPV VVKADGLAAG KGVVVAPDAA TARAAVRDMF GGKFGPAGRR VILEERLEGE EVSILALCDG KTAIPLLPSQ DHKRVGEGDT GPNTGGMGAY APVPFYTPEI AAQVEERVLR PVIRTMAAAG HPYRGVLYAG LMLTREGPKV LEFNCRFGDP ETQPLMLLLA SDLVELMLAA VNGELAGTRI AWYPGAAAGV VLAAGGYPGP YAKGDFITGL EAVAPGVEVF HAGTALVDGQ VVTSGGRVVC VTARGKDLRA ALDRVYDSIR AIHFQGMHYR RDIGRRALEA T
|
| |