Gene Dole_1564 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_1564 
Symbol 
ID5694401 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp1862587 
End bp1863807 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content64% 
IMG OID641264159 
Producttryptophan synthase subunit beta 
Protein accessionYP_001529445 
Protein GI158521575 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0133] Tryptophan synthase beta chain 
TIGRFAM ID[TIGR00263] tryptophan synthase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTTCCC ATGTTTCACA CCAGACGCCG GCGACTGCCC GCCCGGACAC GACCGGTCAT 
TTTGAGCAGT ACGGCGGCAT GTACCTTGCC GAGACCCTGA TGCCGGCGGT CCTGGAACTG
GACGAAAAGC GGCGCCAGAT CATGATTGAC CCGGCGTTTC AAAAAGAGCT GGGGGGCCTG
CTGGCCGATT ACGTGGGCCG ACCCACGCCC CTGTTTTTCG CTAAACGGCT GACGGCCCAC
CTGGGCGGGG CCGCCATCTA TCTGAAGCGG GAGGACCTGG CCCATACCGG GGCTCACAAG
ATCAACAACA CCATCGGCCA GGCGTTGCTT GCCAAGTGGA TGGGTAAAAA CCGGGTGATC
GCCGAGACCG GGGCCGGCCA GCACGGCGTT GCCACGGCCA CGGCCGCGGC CCTGCTGGAC
ATGACCTGTG AAGTCTTTAT GGGGGTTGAG GATATCCAGC GCCAGGCCCC GAACGTGATG
CGGATGAAGC TGCTGGGCGC CACGGTGACA CCGGTGGACT CGGGTTCCGG CACGTTGAAG
GACGCCATGA ACGAGGCCCT GCGCCACTGG GTGGCCCGGG TGCGGGACAC CTTTTACGTG
ATCGGGTCCG TGGCCGGGCC CCATCCCTAC CCGGTGATGG TCCGCGACTT TCAGAGAATC
ATCGGCGATG AAACCCGGCG ACAGATACTG GAGGTCACGG GCCGGCTGCC GGACCTGCTG
GTGGCCTGCG TGGGCGGCGG CAGCAACGCC CTGGGAATTT TTTATCCGTT TCTTTCCGAC
ACCGTGGAGA TGGTGGGCGT GGAGGCGGGC GGCGAAGGCC TTGACACCAA TCGCCACGCC
GCCACCCTGA ACCGGGGGGT GACCGGCGTG CTGCACGGCT CAAAGTCCTA TGTGCTTCAG
GACCGGTTCG GCCAGATCGC GCCGGTGCAC TCGGTTTCCG CCGGCCTGGA CTATCCGGGC
GTGGGGCCGG AACACGCTTT TTTAAAGGAC ACGGGCCGGG TCAGATACAC GGCCATCGAC
GATAAAGAGG CCATGGCCGC CTTTCACCTG CTCTGCCGTA CCGAGGGCAT CATTCCGGCC
CTGGAAAGCT CCCATGCCGT TGCCTGCGTC ATCAAGGAAG CGCCCGGGCG GCCCAAAACA
GACATTCTCA TCGTCAACCT CTCCGGAAGG GGAGACAAAG ACCTGGGGAT CGTATCATCC
GTCATGGAAA AGGAGAAATA G
 
Protein sequence
MRSHVSHQTP ATARPDTTGH FEQYGGMYLA ETLMPAVLEL DEKRRQIMID PAFQKELGGL 
LADYVGRPTP LFFAKRLTAH LGGAAIYLKR EDLAHTGAHK INNTIGQALL AKWMGKNRVI
AETGAGQHGV ATATAAALLD MTCEVFMGVE DIQRQAPNVM RMKLLGATVT PVDSGSGTLK
DAMNEALRHW VARVRDTFYV IGSVAGPHPY PVMVRDFQRI IGDETRRQIL EVTGRLPDLL
VACVGGGSNA LGIFYPFLSD TVEMVGVEAG GEGLDTNRHA ATLNRGVTGV LHGSKSYVLQ
DRFGQIAPVH SVSAGLDYPG VGPEHAFLKD TGRVRYTAID DKEAMAAFHL LCRTEGIIPA
LESSHAVACV IKEAPGRPKT DILIVNLSGR GDKDLGIVSS VMEKEK