Gene Dret_1099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1099 
Symbol 
ID8418924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1288785 
End bp1290242 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content59% 
IMG OID645037671 
Productthreonine synthase 
Protein accessionYP_003197965 
Protein GI258405223 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0498] Threonine synthase 
TIGRFAM ID[TIGR00260] threonine synthase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.113278 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGATGT CGATTCATGA TTTCCCTGCC TATCGCGGCG TCCTTGAGTA TGTCTGCCTG 
CAGTGCCAGG CGCGCTATCC GCAGGATGAA TTGCTCTATA CCTGCCCGGA TTGCGGCGGG
GTGTTTTTGC TTGAAGACAG CCGACAGGAG AGCCTGAAAG AGCTTTCCGG GGCGCAGTGG
CGGGAACGCT TTGACGCCCG GGCTGCTGTC AAACGCAGAG CGCTGCGGGG GATTTTTCGA
TTTTACGAAC TGATCGCCCC GGTGCTTGAC GAGCAGGATA TCGTCTATCT CGGCGAAGGG
CAGACGCCGA TCATACCGTC CAGCCCGGCC TTGAACACGG CGGTGGGACA CACCGTGGCC
ATGAAAAACG ATGGTCAGAA TCCTTCAGCC TCTTTCAAGG ATCGGGGCAT GGCTTGTGCC
TTCAGCTATC TGCAATCCAT GGCCCGGGCC AATGACTGGG ATCAGCTGTT GACCATCTGC
GCCTCGACCG GCGATACCTC GGCCGCAGCC GCCTTGTACG CCGCCTATGT CGGGGCGCCG
CTGACATCAG TGGTCCTCTT GCCCGCGGGC AAGGTCACCG AACAGCAGTT GGCGCAGCCT
TTGGGCAGCG GAGCAGTGGT CCTTGAGGTG CCCGGCGTAT TTGACGATTG TATGCGGGTC
GTGGAATATC TTGCTGATAA CTATCGAGTG GCCCTGCTGA ATTCCAAGAA TCCGTGGCGC
ATTCTGGGCC AGGAATCCTA CGCCTACGAA GTCGCCCAAT GGTATGATTG GAACCTGAGC
GACAAGGCGC TCTTTGTGCC CATTGGCAAT GCCGGCAATA TTACGGCCAT CATGTCCGGG
CTGCTGAAGA TGCACGACCT GGGCATTATC ACCGCTCTGC CGCAGTTGTT CGGGGTCCAG
ACCGCTCACG CCGACCCGGT TTTCCAGTAT TACAGACAAC CGCCCGAGAG CCGGAGTTAC
TCCCCTGTAG AAGTCAAACC GAGCGTGGCC CAGGCGGCCA TGATCGGTAA TCCTGTTTCC
TTTCCCAGGG TGCGCACCTT GGCTGAGCGC TACGAGCAGG TGGCGGGCGA TGGCAGTTTC
CAGGTCGTCC AGGTCCAGGA ACAGGCGATC ATGGAATCCA TGTTGCTGGC CAACCGGCAC
GGGCACATCG CCTGCACCCA GGGTGGGGAA TGTCTGGCCG GGTTGCTCCG GGCCAAGGAA
GAAGGCCGTA TCGACCACAA AACGACCGCG GTCTTGGACG CGACGGCCCA CAGCCTGAAG
TTCATCGGTT TCCAGGACCG GTATTTTCAA AATACCTTTG CTCCGGAATA CGGGATCACT
CCCCAGGCCC AATGGCAGAA CAGGCCGGAG ACGGTTATCG ATCCTGAGGT CAAGAATCGT
TTGGCAGCCG ATGCCTTCGC CCGGGAGGCG GCCCAGGCTG TTGTCGAACG TCTCGGATTG
GAAGGGAAGG AGGACTGA
 
Protein sequence
MSMSIHDFPA YRGVLEYVCL QCQARYPQDE LLYTCPDCGG VFLLEDSRQE SLKELSGAQW 
RERFDARAAV KRRALRGIFR FYELIAPVLD EQDIVYLGEG QTPIIPSSPA LNTAVGHTVA
MKNDGQNPSA SFKDRGMACA FSYLQSMARA NDWDQLLTIC ASTGDTSAAA ALYAAYVGAP
LTSVVLLPAG KVTEQQLAQP LGSGAVVLEV PGVFDDCMRV VEYLADNYRV ALLNSKNPWR
ILGQESYAYE VAQWYDWNLS DKALFVPIGN AGNITAIMSG LLKMHDLGII TALPQLFGVQ
TAHADPVFQY YRQPPESRSY SPVEVKPSVA QAAMIGNPVS FPRVRTLAER YEQVAGDGSF
QVVQVQEQAI MESMLLANRH GHIACTQGGE CLAGLLRAKE EGRIDHKTTA VLDATAHSLK
FIGFQDRYFQ NTFAPEYGIT PQAQWQNRPE TVIDPEVKNR LAADAFAREA AQAVVERLGL
EGKED