Gene Dgeo_1257 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_1257 
Symbol 
ID4058755 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008025 
Strand
Start bp1335183 
End bp1336352 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content64% 
IMG OID641230271 
Producthomocitrate synthase 
Protein accessionYP_604722 
Protein GI94985358 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR02146] homocitrate synthase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.943994 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.64456 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCCAG ATTCCTCCAC GCCCCTGATC CCGGCCCGTT CGTGGGCCAT CATCGACTCG 
ACGCTGCGGG AGGGCGAGCA GTTCGCACGC GGGAACTTCA AAACGGGGGA CAAGATCGAG
ATCGCCCGGC TGCTGGACGC CTTTGGGGCC GAATTTCTGG AAGTGACCAC ACCGATGGTG
GGCGCGCAGA CGCAGGCCGA TATCCGGCGC CTGACCTCGC TCGGTCTGAA TGCCAAGATC
CTCACCCATG TGCGCTGCCA CCTGGAGGAC GTGCAGCGGG CGGTGGATTT GGGCGTGGAC
GGGCTGGACC TGCTGTTTGG CACCAGCTCC TTCCTGCGTG AATTCAGCCA CGGCAAGAGC
ATCGCGCAGA TCATCGACAC GGCCTCAGAG GTGATCGGCT GGATCAAGCA AAACCACCCC
GATCTCGAGA TCCGCTTCAG CGCAGAAGAC ACCTTCCGTT CGGAGGAGGC GGACCTGATG
GCGGTGTACC GCGCCGTCTC CGACCTGGGC GTTCACCGCG TCGGGCTGGC GGACACGGTG
GGGGTGGCGA CTCCCCGGCA GGTCTACACG CTGGTGCGCG AGGTCCGCAA AGTTATCCAC
GCTGAATGCG GCATCGAGTT CCACGGTCAC AACGACACCG GCTGCGCGGT CTCCAACGCC
TATGAGGCGA TTGAGGCGGG CGCCACCCAC ATTGACACGA CCATCTTGGG GATCGGGGAG
CGCAACGGCA TCACGCCGCT GGGGGGCTTC CTGGCGCGGA TGTTTACCTT CGACCCTCAG
GGCTTGATCG ACAAATACAA CCTCGAGCTG CTGCCCGAAC TCGACCGCCT GATCGCGCGG
CTGGTGGATC TGCCGATTCC CTGGAACAAC TACCTGACCG GCGAATTCGC CTACAACCAC
AAGGCGGGAA TGCACCTCAA GGCGATCTAC CTCAACCCCG GTGCCTATGA GGCGATTCCG
CCCAGCGTCT TCGGCGTGGG CCGCCGCATC CAGGCCGCGA GCAAAGTCAC AGGCAAACAT
GCCATCGCCC ACAAGGCCCG TGAGCTGGGA CTGCACTACG GCGAGGACGC CCTGCGCCGC
GTGACCGACC ACATCAAAGC GCTGGCTGAG GAGGGCGAGC TGGACGACGC GCATCTGGAG
CAAGTGCTGC GCGAGTGGGT GCGGGCATAG
 
Protein sequence
MTPDSSTPLI PARSWAIIDS TLREGEQFAR GNFKTGDKIE IARLLDAFGA EFLEVTTPMV 
GAQTQADIRR LTSLGLNAKI LTHVRCHLED VQRAVDLGVD GLDLLFGTSS FLREFSHGKS
IAQIIDTASE VIGWIKQNHP DLEIRFSAED TFRSEEADLM AVYRAVSDLG VHRVGLADTV
GVATPRQVYT LVREVRKVIH AECGIEFHGH NDTGCAVSNA YEAIEAGATH IDTTILGIGE
RNGITPLGGF LARMFTFDPQ GLIDKYNLEL LPELDRLIAR LVDLPIPWNN YLTGEFAYNH
KAGMHLKAIY LNPGAYEAIP PSVFGVGRRI QAASKVTGKH AIAHKARELG LHYGEDALRR
VTDHIKALAE EGELDDAHLE QVLREWVRA