Gene Dgeo_2730 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_2730 
Symbol 
ID4073961 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008010 
Strand
Start bp296721 
End bp298220 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content69% 
IMG OID641228746 
Producthistidine ammonia-lyase 
Protein accessionYP_594237 
Protein GI94972197 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2986] Histidine ammonia-lyase 
TIGRFAM ID[TIGR01225] histidine ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGATCCTAG ACCGGCAATT GACGCTTGAC GACTTTATCC GCGTGGTGCG TGGCGGCGAG 
GAGGTGACCC TTGCTGATGC GGCGCGGACA CGGATGGGAC GAGCGCGGGC GGTGATCGAG
CGCATCGTCG ATGGCCCCGA AGCCGTGTAC GGCGTGAACA CGGGCTTTGG CAAGTTCGCC
TCGGTCCGCG TGGCGCGCGA GGAGCTGAAG CAGCTCCAGC ACAACCTGAT TGTGTCGCAT
GCAATCGGGG TAGGTGCAGG TTTGCCTGCC GAAGTCGTGC GCGGAATGCT GCTGCTGCGG
GCCCAGTCGC TCGCTCTTGG GCATTCGGGG GTGCGGCCGG AGGTGGTCGA ACTCCTGCTC
GCGCTGCTCA ACGCGGGAGC CTGCCCGGTC GTGCCTGCCC AAGGCAGCGT GGGGGCGAGC
GGCGACCTGG CCCCGCTCGC GCACCTCGCG TTGGCCCTGA TCGGAGGGGG CGAATTGGAA
TACGGCGGTC AGGTGCGGCC CGCTGCCGAC GTGCTGGCCG AACTGGGTCT CCAGCCGCTC
ACGCTGGAGG CAAAGGAGGG GCTGGCCCTC ATCAACGGCA CACAGCTGAT GGGCAGCCTG
CTGGCCCTCG CGCTGCACGA CGCCCGGACA CTGCTGCACA CGGCGAACCT GGCCGCTGCA
ATGACGGTGG AGGCGCTGTC CGGCAGTCAC CGGCCTTTCA GTGAGGGCGT GGTGAGCTTA
CGGCCCCACC CCGGCGCGCT GGAGGTCGCC GCCGACCTGC GCGCCTTCCT GCACGGTTCG
GACATCGCAC CGGCCCACGC CCACTGCGGC AAGGTGCAGG ATGCCTACAG TCTGCGGGCG
GTGCCCCAGG TCCACGGCGC TTCCCTCGAC GCACTAATGC AAGCGGGGCG CGTGCTGGAG
GTGGAATTTG CCAGCGTGAC CGACAATCCG CTGATCTTCC CCGAGACGGG CGAGGTGATC
TCGGGCGGCA ATTTCCACGG GCAGCCCCTT GCCCTGGCGG CCGATGCCCT GAAGGTGGCG
GTGGCCGAAC TGGCGAACAT CAGCGAACGC CGCAGCGAGC AACTGCTGAA TCCGGCCCTG
TCGGGGCTAC CGGGGTTCCT GACGCCGGAA GGGGGCTTAA GCAGCGGCTT CATGATCGCG
CAGTACACCG CCGCCGCCCT GGTCAGCGAG AACAAGGTGC TGGCCCACCC CGCCAGCGTG
GACTCGATTC CGACGAGCGC CAATCAGGAA GACCATGTCA GCATGGGCGC GCATGGAGCA
CGGCAGCTGC GGCAGATCCT GGAAAACGCG CAGAGCGTCA TCAGCATCGA GCTGCTGTGC
GCCGCGCAGG CCCTGGACTT CCAGTCGCTG CGCGCTGGGC GAGGCGTGCA GGCCGCCTAC
GAGCGCATCC GGCAGGAGGT CGCACCGCTC GGCCAGGACC GCTACTACCG GCCCGACCTC
CTGCGGGTGC GCGAGCTGGT GACCAGCGGC GAGCTGCTGC GGGCCGCCCG GGAGGCTTGA
 
Protein sequence
MILDRQLTLD DFIRVVRGGE EVTLADAART RMGRARAVIE RIVDGPEAVY GVNTGFGKFA 
SVRVAREELK QLQHNLIVSH AIGVGAGLPA EVVRGMLLLR AQSLALGHSG VRPEVVELLL
ALLNAGACPV VPAQGSVGAS GDLAPLAHLA LALIGGGELE YGGQVRPAAD VLAELGLQPL
TLEAKEGLAL INGTQLMGSL LALALHDART LLHTANLAAA MTVEALSGSH RPFSEGVVSL
RPHPGALEVA ADLRAFLHGS DIAPAHAHCG KVQDAYSLRA VPQVHGASLD ALMQAGRVLE
VEFASVTDNP LIFPETGEVI SGGNFHGQPL ALAADALKVA VAELANISER RSEQLLNPAL
SGLPGFLTPE GGLSSGFMIA QYTAAALVSE NKVLAHPASV DSIPTSANQE DHVSMGAHGA
RQLRQILENA QSVISIELLC AAQALDFQSL RAGRGVQAAY ERIRQEVAPL GQDRYYRPDL
LRVRELVTSG ELLRAAREA