Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_1222 |
Symbol | |
ID | 4486163 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | - |
Start bp | 1361104 |
End bp | 1363998 |
Gene Length | 2895 bp |
Protein Length | 964 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639729998 |
Product | glycine dehydrogenase |
Protein accession | YP_872980 |
Protein GI | 117928429 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0403] Glycine cleavage system protein P (pyridoxal-binding), N-terminal domain [COG1003] Glycine cleavage system protein P (pyridoxal-binding), C-terminal domain |
TIGRFAM ID | [TIGR00461] glycine dehydrogenase (decarboxylating) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.117905 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACACTCG CACCGCACGG CCAGCCGGTT TCTGACGCGT TCACAGCTGG TGCAGCCGAC GGTTTCCTGC GCCGGCACAT CGGTCCGGAC GACGCCGAGA TTACCCGCAT GCTCTCGGTA GTCGGCTACC CAAGCCTGGA CGCCCTGATG GATGCGGCGC TGCCGCCGGC GATCCGTGAC CCTGTGGACC GGCCCAGCCT GTTGCCACCC CCGGTGGACG AAGCAGCGGT TACGGCCGCA CTGCGGGAGA TTGCCGGCAT GAACCGGCCG CTCACCTCGA TGATCGGACT TGGCTACTAC CGGTCGCACA CGCCCGCGGT GATCCGGCGG AACGTGCTGG AGAATCCGGC GTGGTACACC GCGTATACCC CGTACCAGCC GGAAATTTCC CAAGGCCGCC TGGAGGCGTT GCTCGTCTTT CAGACGATGA TCGAGGACCT CACCGGGTTG GACGTTGCGG GGGCTTCCCT TCTCGACGAG CCGACGGCAG CCGCCGAGGC CGTGGCGTTA TGCCGTCGGA TGTCGACGTC CGCCAGCCGG CGGGTCGTCG TCGACCGTGA CGTTTTTCCG CAAACACGGG CCGTTCTGCA GACACGGGCG AAGCCGATGC AGTGGGAGGT CGTCGTCGCA GACCTTGGCG CAGGTTTGCC GGACGGCGAC TTTTTCGCGG TGCTGGTGCA GAATCCCGGG ACGAGCGGCA GGGTCCGCGA CTATCGCGCG CTGACGGCGG AGGCACACGC GCGCGGCGCG TTCGTCATCG CGGCCGTCGA TGTGCTGTCC CTTGCCCTGC TGCCGCCGCC GGGGGAGTGG GGTGCGGACG TCGCCGTCGG TTCAGCCCAG CGTTTCGGTG TTCCGCTCTG GTACGGCGGG CCGCACGCCG GATTTCTTGC GGCACGCGCC GACTTCACCC GTTCATTGCC CGGACGGCTG GTCGGCGTCT CGGTGGACGG CGACGGCCGG CCGGCGTACC GGTTGGCGTT GCAAACCCGC GAGCAGCACA TCCGGCGGGA GAAGGCGACC AGCAACATCT GCACGGCGCA GGTCCTGCTC GCGGTGGTCG CGGCCATGTA CGCCGTGTAC CACGGACCGG ATGGGCTGCA ACGCATCGCC CGACGCGTGC ACGACACGGC GCGGACGCTG GCCGCGCTGC TTCGCGGCGC CGATTTCGCG GTCACCGATG ACTTCTTCGA CACGGTGGAG GTGAGCGTGC CGGGGCAGGC CGACCGGTAC GTCGCACGAG CCCTCGACGA GGGCATCAAT ATCCGACGCG TCGACGCCGA CACGGTCGCC GTTTCCTGCG ACGAGACAAC GACTCTCGAT GACCTCCGTC GGCTGGCTGC GGCGTTCGGC ATCGCAACGG ATGTCAACCA CCTCACTGAG CTCAGCCATC AGCTGCCGGC GTCACCGTTG CCGCGGCGGG ACTCGGAGTT TCTCACGCAT CCCACCTTCC ACCGGTACCG GTCCGAGACG GCGATGATGC GGTACCTCCG TCGGCTGGCT GACAAGGACA TTGCCTTGGA TCGGTCGATG ATTCCGCTCG GCTCCTGCAC GATGAAACTC AACGCGGCCG TCGAATTGGA GGCGTTGAGT TGGCCGGAGT TCGCCGACAT TCACCCGTTC GTCCCGGCCG ACCAGGCGGC GGGGTATCAC CGCATCGTCG CGGATTTGCA ACGCTGGCTC GCCGACCTGA CCGGTTATGA CGCGGTGAGT CTGCAACCGA ATGCCGGTTC CCAGGGAGAA TTCGCCGGCT TGCTGGCCAT TCGCGCGTAT CACCAAGCTC GGGGTGAGGG GCATCGGGAC GTCTGCCTGA TTCCGGCGTC GGCTCACGGC ACGAATGCGG CGAGCGCGGC AATGGCGGGT TTCCGGGTGG TTGTCGTCCG GTGCGACGCG GACGGGAACG TCGATCTCGA CGATTTGGAG GCGAAATTGG CTGCACACCA GGGTCAGGTG GCGGCCATCA TGCTGACGTA TCCGTCGACA CACGGAGTTT TTGAGGAAGC CGTGACCGAC ATTTGCCAGC GAGTGCACGA GGCCGGGGGA CAGGTGTACC TCGACGGGGC GAATCTCAAC GCGTTGCTCG GCTATGCCCG GTTCGGCGCG TTCGGCGCCG ACGTTTCACA CGTGAATTTG CACAAGACCT TCTGCATCCC GCACGGCGGG GGCGGACCGG GTGTGGGCCC GATCGGGGTG CGGGCGCACC TTGCGCCGTA CCTGCCGAAC CATCCGCTGG ATCCCGCGGC CGGACCCGCC ACCGGCCCCG GGCCGGTGGC GGGCGCGCCG TACGGCTCAC CGGGCGTGCT GCCAATTTCT TGGGCATACC TGCGGCTGAT GGGCATTGAC GGGTTGCGGC GCGCGACGGA CGTCGCGGTG CTGGCCGCCA ATTACGTCGC CCGGCGGCTT GCCGATGCGT TTCCCGTGCT CTACACCGGG CGGAATGGTC TGGTCGCCCA CGAGTGCATT CTCGATCTGC GCGACATCAC GCGTCGCACC GGGATCACCG TCGAGGATGT GGCGAAGCGA TTGATGGATT ACGGTTTTCA TGCGCCGACC ATGTCGTTTC CGGTGGCCGG TACGCTCATG GTCGAGCCGA CGGAATCGGA GAATCTCGCG GAATTGGACC GTTTCGTCGC CGCTATGCGG GCGATCCGGG CGGAAATCGC CCGGGTGGAG CGGGGCGAGT GGCCGGCGGA CGACAATCCG CTGCGCAATG CACCGCACAC CGCATTGGCG CTTGCCGGCG AGTGGCGGCA TCCGTATTCA CGGGAGGAGG CGTTCTTCCC GCTGCCGGAA ATCCGGGAGA ACAAGTATTT TCCGCCGGTG GCCCGCATTG ACGGCGCGTA CGGCGACCGA CATCTCGTCT GCGAGTGCCC GCCGCTGAGC GCCTACGAAG ACTGA
|
Protein sequence | MTLAPHGQPV SDAFTAGAAD GFLRRHIGPD DAEITRMLSV VGYPSLDALM DAALPPAIRD PVDRPSLLPP PVDEAAVTAA LREIAGMNRP LTSMIGLGYY RSHTPAVIRR NVLENPAWYT AYTPYQPEIS QGRLEALLVF QTMIEDLTGL DVAGASLLDE PTAAAEAVAL CRRMSTSASR RVVVDRDVFP QTRAVLQTRA KPMQWEVVVA DLGAGLPDGD FFAVLVQNPG TSGRVRDYRA LTAEAHARGA FVIAAVDVLS LALLPPPGEW GADVAVGSAQ RFGVPLWYGG PHAGFLAARA DFTRSLPGRL VGVSVDGDGR PAYRLALQTR EQHIRREKAT SNICTAQVLL AVVAAMYAVY HGPDGLQRIA RRVHDTARTL AALLRGADFA VTDDFFDTVE VSVPGQADRY VARALDEGIN IRRVDADTVA VSCDETTTLD DLRRLAAAFG IATDVNHLTE LSHQLPASPL PRRDSEFLTH PTFHRYRSET AMMRYLRRLA DKDIALDRSM IPLGSCTMKL NAAVELEALS WPEFADIHPF VPADQAAGYH RIVADLQRWL ADLTGYDAVS LQPNAGSQGE FAGLLAIRAY HQARGEGHRD VCLIPASAHG TNAASAAMAG FRVVVVRCDA DGNVDLDDLE AKLAAHQGQV AAIMLTYPST HGVFEEAVTD ICQRVHEAGG QVYLDGANLN ALLGYARFGA FGADVSHVNL HKTFCIPHGG GGPGVGPIGV RAHLAPYLPN HPLDPAAGPA TGPGPVAGAP YGSPGVLPIS WAYLRLMGID GLRRATDVAV LAANYVARRL ADAFPVLYTG RNGLVAHECI LDLRDITRRT GITVEDVAKR LMDYGFHAPT MSFPVAGTLM VEPTESENLA ELDRFVAAMR AIRAEIARVE RGEWPADDNP LRNAPHTALA LAGEWRHPYS REEAFFPLPE IRENKYFPPV ARIDGAYGDR HLVCECPPLS AYED
|
| |