Gene Acel_1222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_1222 
Symbol 
ID4486163 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp1361104 
End bp1363998 
Gene Length2895 bp 
Protein Length964 aa 
Translation table11 
GC content67% 
IMG OID639729998 
Productglycine dehydrogenase 
Protein accessionYP_872980 
Protein GI117928429 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0403] Glycine cleavage system protein P (pyridoxal-binding), N-terminal domain
[COG1003] Glycine cleavage system protein P (pyridoxal-binding), C-terminal domain 
TIGRFAM ID[TIGR00461] glycine dehydrogenase (decarboxylating) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.117905 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACTCG CACCGCACGG CCAGCCGGTT TCTGACGCGT TCACAGCTGG TGCAGCCGAC 
GGTTTCCTGC GCCGGCACAT CGGTCCGGAC GACGCCGAGA TTACCCGCAT GCTCTCGGTA
GTCGGCTACC CAAGCCTGGA CGCCCTGATG GATGCGGCGC TGCCGCCGGC GATCCGTGAC
CCTGTGGACC GGCCCAGCCT GTTGCCACCC CCGGTGGACG AAGCAGCGGT TACGGCCGCA
CTGCGGGAGA TTGCCGGCAT GAACCGGCCG CTCACCTCGA TGATCGGACT TGGCTACTAC
CGGTCGCACA CGCCCGCGGT GATCCGGCGG AACGTGCTGG AGAATCCGGC GTGGTACACC
GCGTATACCC CGTACCAGCC GGAAATTTCC CAAGGCCGCC TGGAGGCGTT GCTCGTCTTT
CAGACGATGA TCGAGGACCT CACCGGGTTG GACGTTGCGG GGGCTTCCCT TCTCGACGAG
CCGACGGCAG CCGCCGAGGC CGTGGCGTTA TGCCGTCGGA TGTCGACGTC CGCCAGCCGG
CGGGTCGTCG TCGACCGTGA CGTTTTTCCG CAAACACGGG CCGTTCTGCA GACACGGGCG
AAGCCGATGC AGTGGGAGGT CGTCGTCGCA GACCTTGGCG CAGGTTTGCC GGACGGCGAC
TTTTTCGCGG TGCTGGTGCA GAATCCCGGG ACGAGCGGCA GGGTCCGCGA CTATCGCGCG
CTGACGGCGG AGGCACACGC GCGCGGCGCG TTCGTCATCG CGGCCGTCGA TGTGCTGTCC
CTTGCCCTGC TGCCGCCGCC GGGGGAGTGG GGTGCGGACG TCGCCGTCGG TTCAGCCCAG
CGTTTCGGTG TTCCGCTCTG GTACGGCGGG CCGCACGCCG GATTTCTTGC GGCACGCGCC
GACTTCACCC GTTCATTGCC CGGACGGCTG GTCGGCGTCT CGGTGGACGG CGACGGCCGG
CCGGCGTACC GGTTGGCGTT GCAAACCCGC GAGCAGCACA TCCGGCGGGA GAAGGCGACC
AGCAACATCT GCACGGCGCA GGTCCTGCTC GCGGTGGTCG CGGCCATGTA CGCCGTGTAC
CACGGACCGG ATGGGCTGCA ACGCATCGCC CGACGCGTGC ACGACACGGC GCGGACGCTG
GCCGCGCTGC TTCGCGGCGC CGATTTCGCG GTCACCGATG ACTTCTTCGA CACGGTGGAG
GTGAGCGTGC CGGGGCAGGC CGACCGGTAC GTCGCACGAG CCCTCGACGA GGGCATCAAT
ATCCGACGCG TCGACGCCGA CACGGTCGCC GTTTCCTGCG ACGAGACAAC GACTCTCGAT
GACCTCCGTC GGCTGGCTGC GGCGTTCGGC ATCGCAACGG ATGTCAACCA CCTCACTGAG
CTCAGCCATC AGCTGCCGGC GTCACCGTTG CCGCGGCGGG ACTCGGAGTT TCTCACGCAT
CCCACCTTCC ACCGGTACCG GTCCGAGACG GCGATGATGC GGTACCTCCG TCGGCTGGCT
GACAAGGACA TTGCCTTGGA TCGGTCGATG ATTCCGCTCG GCTCCTGCAC GATGAAACTC
AACGCGGCCG TCGAATTGGA GGCGTTGAGT TGGCCGGAGT TCGCCGACAT TCACCCGTTC
GTCCCGGCCG ACCAGGCGGC GGGGTATCAC CGCATCGTCG CGGATTTGCA ACGCTGGCTC
GCCGACCTGA CCGGTTATGA CGCGGTGAGT CTGCAACCGA ATGCCGGTTC CCAGGGAGAA
TTCGCCGGCT TGCTGGCCAT TCGCGCGTAT CACCAAGCTC GGGGTGAGGG GCATCGGGAC
GTCTGCCTGA TTCCGGCGTC GGCTCACGGC ACGAATGCGG CGAGCGCGGC AATGGCGGGT
TTCCGGGTGG TTGTCGTCCG GTGCGACGCG GACGGGAACG TCGATCTCGA CGATTTGGAG
GCGAAATTGG CTGCACACCA GGGTCAGGTG GCGGCCATCA TGCTGACGTA TCCGTCGACA
CACGGAGTTT TTGAGGAAGC CGTGACCGAC ATTTGCCAGC GAGTGCACGA GGCCGGGGGA
CAGGTGTACC TCGACGGGGC GAATCTCAAC GCGTTGCTCG GCTATGCCCG GTTCGGCGCG
TTCGGCGCCG ACGTTTCACA CGTGAATTTG CACAAGACCT TCTGCATCCC GCACGGCGGG
GGCGGACCGG GTGTGGGCCC GATCGGGGTG CGGGCGCACC TTGCGCCGTA CCTGCCGAAC
CATCCGCTGG ATCCCGCGGC CGGACCCGCC ACCGGCCCCG GGCCGGTGGC GGGCGCGCCG
TACGGCTCAC CGGGCGTGCT GCCAATTTCT TGGGCATACC TGCGGCTGAT GGGCATTGAC
GGGTTGCGGC GCGCGACGGA CGTCGCGGTG CTGGCCGCCA ATTACGTCGC CCGGCGGCTT
GCCGATGCGT TTCCCGTGCT CTACACCGGG CGGAATGGTC TGGTCGCCCA CGAGTGCATT
CTCGATCTGC GCGACATCAC GCGTCGCACC GGGATCACCG TCGAGGATGT GGCGAAGCGA
TTGATGGATT ACGGTTTTCA TGCGCCGACC ATGTCGTTTC CGGTGGCCGG TACGCTCATG
GTCGAGCCGA CGGAATCGGA GAATCTCGCG GAATTGGACC GTTTCGTCGC CGCTATGCGG
GCGATCCGGG CGGAAATCGC CCGGGTGGAG CGGGGCGAGT GGCCGGCGGA CGACAATCCG
CTGCGCAATG CACCGCACAC CGCATTGGCG CTTGCCGGCG AGTGGCGGCA TCCGTATTCA
CGGGAGGAGG CGTTCTTCCC GCTGCCGGAA ATCCGGGAGA ACAAGTATTT TCCGCCGGTG
GCCCGCATTG ACGGCGCGTA CGGCGACCGA CATCTCGTCT GCGAGTGCCC GCCGCTGAGC
GCCTACGAAG ACTGA
 
Protein sequence
MTLAPHGQPV SDAFTAGAAD GFLRRHIGPD DAEITRMLSV VGYPSLDALM DAALPPAIRD 
PVDRPSLLPP PVDEAAVTAA LREIAGMNRP LTSMIGLGYY RSHTPAVIRR NVLENPAWYT
AYTPYQPEIS QGRLEALLVF QTMIEDLTGL DVAGASLLDE PTAAAEAVAL CRRMSTSASR
RVVVDRDVFP QTRAVLQTRA KPMQWEVVVA DLGAGLPDGD FFAVLVQNPG TSGRVRDYRA
LTAEAHARGA FVIAAVDVLS LALLPPPGEW GADVAVGSAQ RFGVPLWYGG PHAGFLAARA
DFTRSLPGRL VGVSVDGDGR PAYRLALQTR EQHIRREKAT SNICTAQVLL AVVAAMYAVY
HGPDGLQRIA RRVHDTARTL AALLRGADFA VTDDFFDTVE VSVPGQADRY VARALDEGIN
IRRVDADTVA VSCDETTTLD DLRRLAAAFG IATDVNHLTE LSHQLPASPL PRRDSEFLTH
PTFHRYRSET AMMRYLRRLA DKDIALDRSM IPLGSCTMKL NAAVELEALS WPEFADIHPF
VPADQAAGYH RIVADLQRWL ADLTGYDAVS LQPNAGSQGE FAGLLAIRAY HQARGEGHRD
VCLIPASAHG TNAASAAMAG FRVVVVRCDA DGNVDLDDLE AKLAAHQGQV AAIMLTYPST
HGVFEEAVTD ICQRVHEAGG QVYLDGANLN ALLGYARFGA FGADVSHVNL HKTFCIPHGG
GGPGVGPIGV RAHLAPYLPN HPLDPAAGPA TGPGPVAGAP YGSPGVLPIS WAYLRLMGID
GLRRATDVAV LAANYVARRL ADAFPVLYTG RNGLVAHECI LDLRDITRRT GITVEDVAKR
LMDYGFHAPT MSFPVAGTLM VEPTESENLA ELDRFVAAMR AIRAEIARVE RGEWPADDNP
LRNAPHTALA LAGEWRHPYS REEAFFPLPE IRENKYFPPV ARIDGAYGDR HLVCECPPLS
AYED