Gene Dgeo_1201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_1201 
Symbol 
ID4057711 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008025 
Strand
Start bp1276719 
End bp1277918 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content69% 
IMG OID641230216 
Productthreonyl/alanyl tRNA synthetase, SAD 
Protein accessionYP_604667 
Protein GI94985303 
COG category[R] General function prediction only 
COG ID[COG2872] Predicted metal-dependent hydrolases related to alanyl-tRNA synthetase HxxxH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.703964 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGGC CGCTCTATGA CACGTCGCCC ACCCGCCTGA CCTTTACGGC TACCGTCACC 
GACCGGCGTG ACGGGGCCGT CGCGCTGGAT GCCAGCGCCT TCTACCCAGA GGGCGGCGGC
CAGAACGGTG ACGTCGGTGT CTTGCGCTGG CGCGGGGGTG AAGCGCGCGT TACGGACACC
CGCAAGGACA AATCCAGCGG CGTCATTTGG CACGAAGGAG AGGGTGAGCT GCCGCCGGTC
GGCGCTTCCA TCCGCGGTGA GGTAGACCCG GTTTGCCGCT GGCGCAACAT GGCCCGTCAC
AGTGCCGAGC ATCTGCTCGC GCAGGCTTTT CACCGCGTCA ACCCCGCCTT CGCGGTGGCC
GCTGTCAGCC TGCGGAATGC CGAGAGCACG ATCGACCTGA CCGGTGATCC GGGCGAGGCC
GATGTGCGCG CCGCGGAGAG GTTGCTGCGG GAGACGCTCG CCCGCACCGA GCTGACGCTG
GAAACGCTGA CGGTCCCCGA GGAGGAGCTG TACCGGTACC CGCTGCGCCG CGAAGCGAAG
GTGCGGGGCG ATGTGCGCCT GGTGATCTTC CGCGACGCAG AGGGTATCCC CTTCGATGTA
AGCGCCTGTG GCGGCACCCA TGTTCCGTGT GCCAGCATGG TCGCTCCGGT TGTCGTGCTG
CGCACCGAGC GGATCCGGGG CGGCCTGACC CGCGTGTTCT TCATGGCAGG CGAGGAGGCG
AGTGCGTACC TAGCGGACGT GTACCGTGAC GCCCGTGCGT TGGCGCAGAG CTTCAGCGCC
TCGGTGTCTG ATCTGCCCGG TCGCGTGGCG GCGCTGGCCA CCGAGCGTGA CATGCTGAAG
GCGGAAGGAA CTGCTCTGCG CGCGCGGCTC GCGCGTCTCC TGGCCGATGC TGCCCCGCTG
GAGACGGTCG CAGGTGTTCC GCTGCGTCTC CTGAATCTGG AGGACCCGAA CCTCCTCTCC
GACGCGCTGG CCGCCACGCC CAAAGGTGAG GTCCGGGTGG CTCTGGCTTC CGGTGGGCGC
TGCGGCGTCG GCAGTGGCCG GGAGGACGTG TCCGCCGGGA AGCTGTTGGA GGCGGCGCTC
AGACTCACGG GTGGCAAGGG GGGCGGGCGG CCCGCGCTGG CCCAGGGCCG TACCGCTGCG
CCCGAACAAT TTGGGGAGGC TGTCCGGGAA GTTCTGCAGA CTACCCGAGC ACCCGCCTAA
 
Protein sequence
MTRPLYDTSP TRLTFTATVT DRRDGAVALD ASAFYPEGGG QNGDVGVLRW RGGEARVTDT 
RKDKSSGVIW HEGEGELPPV GASIRGEVDP VCRWRNMARH SAEHLLAQAF HRVNPAFAVA
AVSLRNAEST IDLTGDPGEA DVRAAERLLR ETLARTELTL ETLTVPEEEL YRYPLRREAK
VRGDVRLVIF RDAEGIPFDV SACGGTHVPC ASMVAPVVVL RTERIRGGLT RVFFMAGEEA
SAYLADVYRD ARALAQSFSA SVSDLPGRVA ALATERDMLK AEGTALRARL ARLLADAAPL
ETVAGVPLRL LNLEDPNLLS DALAATPKGE VRVALASGGR CGVGSGREDV SAGKLLEAAL
RLTGGKGGGR PALAQGRTAA PEQFGEAVRE VLQTTRAPA