Gene Dole_2074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_2074 
Symbol 
ID5694917 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp2525989 
End bp2527365 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content59% 
IMG OID641264675 
Producttryptophan synthase subunit beta 
Protein accessionYP_001529955 
Protein GI158522085 
COG category[R] General function prediction only 
COG ID[COG1350] Predicted alternative tryptophan synthase beta-subunit (paralog of TrpB) 
TIGRFAM ID[TIGR01415] pyridoxal-phosphate dependent TrpB-like enzyme 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTTCCC GGAAAATTTT TCTGACAGAA GATGAGATGC CGCGCCAGTG GTACAACATT 
CTGGCGGACA TCAAGATGAA TCCGCCGCTG GGACCCGACG GCAATCCCGT GGGACCCGAT
TCCCTGGCCC CTGTGTTCCC CATGAACCTT ATCGAGCAGG AGGTCAGCAC CGAGCGGTGG
ATCACCATTC CTGACGAGGT ACTGGACATT CTGACGACGT GGCGGCCGTC CCCCCTGGTG
CGGGCCCGCA ATCTTGAAAA GGCGCTGGGA ACGCCGGCCA AGATTTACTA CAAGAACGAA
AGCGTCAGCC CTGCCGGAAG CCATAAGCCC AACACCGCTG TCGCCCAAGC CTACTACAAC
AAGGAGTTCG GCATCAAGAA GCTCACCACC GAAACCGGGG CCGGCCAGTG GGGCAGCGCC
CTCTCCTATG CCTGTTCCCA GTTCGGCCTG GAGTGCAAGA TATTCATGGT ACGGATCAGC
TTCGACCAGA AGCCCTACCG CAAAAGCATG ATGGGCGCCT GGGGTGGTAA CTGTATTCCC
AGCCCCAGCG ACCAGACCCG TGCGGGCCGC GACGCCCTGG CCAAAGATCC CAACACCCCG
GGCAGCCTGG GCATTGCCAT CAGCGAAGCC ATCGAATGTG CCGTCACCGA CGAATCGGGA
GAGACCCGTT ATGCCCTGGG CAGCGTGCTC AACCACGTGA TGCTGCACCA GACCATCATC
GGGCTGGAAG CCAGGAAACA GTTTGAAAAA GTCGGCGACT ATCCGGATGT CATCATCGGA
TGCGCCGGCG GCGGCAGCAA CTTTGCCGGC ATCGCTTTTC CCTTTGTCTA CGACAAGATT
CACGGCAAAG ATATTGAGAT TTACCCGGTG GAGCCCATGG GCTGCCCCAC CATGACCAAG
GCCCCCTTTG TTTACGACCA CGGCGATACC GCCAAGTACA CCCCCCTGCT GGCCATGCAC
AGCCTGGGTC ATGCCTTTGT TCCGCCGCCT TTTCACGCGG GCGGGCTCCG TTACCACGGC
ATGGCGCCCA CGGTCAGCCA GCTGGTCTGC GAAGGCATTG TTACCCCCCG GGCGGTTTCC
CAGTTGAGCA CCTTTGAGGC GGGCGTGCTG TTTGCCCGTT CCGAAGGTAT CATTCCCGCG
CCCGAGAGCA ATCACGCCAT CGCCTGTGTC ATTGAAGAGG CCAACAAGGC AAAGGAAGAG
GGCAAGGAAA AGGTGATCCT GTTCAACCTG AGCGGTCATG GCCTTCTGGA CCTGGCCGGA
TACGACCGGT TCTTTGCCGG CGAGCTGTCC AACATTCTCA TGAACGATGA TGATCTGAAG
GCGTCGGAAG CGGTGTTTGC CGATTATCCC AAGCCTGCGA TCCTCAAGCA CGATTAG
 
Protein sequence
MTSRKIFLTE DEMPRQWYNI LADIKMNPPL GPDGNPVGPD SLAPVFPMNL IEQEVSTERW 
ITIPDEVLDI LTTWRPSPLV RARNLEKALG TPAKIYYKNE SVSPAGSHKP NTAVAQAYYN
KEFGIKKLTT ETGAGQWGSA LSYACSQFGL ECKIFMVRIS FDQKPYRKSM MGAWGGNCIP
SPSDQTRAGR DALAKDPNTP GSLGIAISEA IECAVTDESG ETRYALGSVL NHVMLHQTII
GLEARKQFEK VGDYPDVIIG CAGGGSNFAG IAFPFVYDKI HGKDIEIYPV EPMGCPTMTK
APFVYDHGDT AKYTPLLAMH SLGHAFVPPP FHAGGLRYHG MAPTVSQLVC EGIVTPRAVS
QLSTFEAGVL FARSEGIIPA PESNHAIACV IEEANKAKEE GKEKVILFNL SGHGLLDLAG
YDRFFAGELS NILMNDDDLK ASEAVFADYP KPAILKHD