Gene Daro_4123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_4123 
Symbol 
ID3566701 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp4422278 
End bp4423810 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content61% 
IMG OID637682595 
Productthreonine dehydratase 
Protein accessionYP_287319 
Protein GI71909732 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1171] Threonine dehydratase 
TIGRFAM ID[TIGR01124] threonine ammonia-lyase, biosynthetic, long form
[TIGR01127] threonine dehydratase, medium form 


Plasmid Coverage information

Num covering plasmid clones65 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACCCGG ATTATCTCGA AAAAGTTCTC AACGCACAGG TTTACGACGT GGCGATCGAA 
ACGCCGCTCG ACCTGGCCAG CAACCTTTCC GCCCGGGTCG GCAACAAGAT TATCCTGAAG
CGCGAAGACA TGCAGCCTGT CTTTTCCTTC AAGCTGCGTG GCGCCTACAA CAAGATTGCC
AGCCTGTCGG CCGAAAAGCT GAAGCGCGGT GTCATCTGTG CCTCGGCCGG CAACCACGCC
CAAGGGGTCG CCCTGTCGGC CAGCAAACTG GGTTGCCGGG CGGTCATCGT CATGCCGACC
TCGACGCCGG GCATCAAGAT TAACGCCGTT AAATCGCGGG GCGGCGAAGT GGTGCTGCAT
GGCGACTCCT TCGACGAAGC ATACGCCCAT GCGGTCGAAC TGGAAAAAAC CGAAAAGCTG
ACCTTCGTTC ATCCCTTCGA CGATCCGGAG GTGATCGCCG GACAGGGCAC GGTCGCCATG
GAAATCCTGC GCCAGCATTC GCGCCACAAC GGCCCGATCA CCGCCGTCTT CTGTGCCATC
GGCGGCGGCG GGCTGGCTGC TGGCGTGGCC GCCTACATCA AGCGCCTGCG TCCGGAAATC
AAGGTGATCG GTGTCGAAAC CTTCGACGCC GACGCCATGA AGCAGTCGCT GGCCGCCGGC
AAGCGCGTTC GCGTCGATCA GGTCGGGTTG TTCTCCGACG GCACCGCCGT CAAGCTGGTC
GGCGAAGAAA CCTTCCGGCT GTGCAAGGAA TACCTGGACG AGGTCATCCT GGTCGACACC
GATGCCATCT GTGCCGCCAT CAAGGATGTT TTCGAGGATA CCCGCTCGAT TCTGGAACCG
GCCGGGGCGC TGGCCGTCGC CGGCGCCAAG GAATATGCCC GTCAGCACAA ACTGAAGGAC
AAGAACCTGA TCGCCATCAC CTCCGGCGCC AACATGAATT TCGACCGCCT GCGCTTCGTT
GCCGAGCGGG CCGAATTCGG CGAACAGCGT GAAGCCGTCT TCGCGGTCAC GCTGCCCGAG
AAGCCGGGTG CCTACAAGAA ATTCCTTGGG CTGATCGGTC ACCGCAATGT GACCGAATTC
AACTACCGCT TCCACACCGC CAGCGAAGCT CACGTCTTCG TCGGCGTTCA GGTGGCCGAC
CGCAAGGAAT CGCTGAAGCT GGTCGACAGC CTGCAGAAAC ACGGTTACCC GACGCTCGAC
CTGACCGATG ACGAAATGGC CAAGAACCAT ATCCGTCACA TGGTCGGCGG CCATGCCCCG
GCGGTTTGCG AAAAGGGCAT GCGCGAGTTG CTCTATCGCT TCGAATTCCC GGAAAAGCCG
GGCGCCCTGA TGAACTTCCT GACCCAGATG AGCGCCGGCT GGAACATCAG CCTGTTCCAC
TACCGCAACC ATGGCGCCGA CTACGGTCGC GTCCTGGTCG GCATGCAGGT ACCGCCGGAC
GACATGGGGA AATTCAAGGA ATTCCTGACC AATCTCGGCT ACGCCCACTG GGACGAGAGC
CAAAACCCGG CATACAAGCT GTTCCTGGGT TAA
 
Protein sequence
MHPDYLEKVL NAQVYDVAIE TPLDLASNLS ARVGNKIILK REDMQPVFSF KLRGAYNKIA 
SLSAEKLKRG VICASAGNHA QGVALSASKL GCRAVIVMPT STPGIKINAV KSRGGEVVLH
GDSFDEAYAH AVELEKTEKL TFVHPFDDPE VIAGQGTVAM EILRQHSRHN GPITAVFCAI
GGGGLAAGVA AYIKRLRPEI KVIGVETFDA DAMKQSLAAG KRVRVDQVGL FSDGTAVKLV
GEETFRLCKE YLDEVILVDT DAICAAIKDV FEDTRSILEP AGALAVAGAK EYARQHKLKD
KNLIAITSGA NMNFDRLRFV AERAEFGEQR EAVFAVTLPE KPGAYKKFLG LIGHRNVTEF
NYRFHTASEA HVFVGVQVAD RKESLKLVDS LQKHGYPTLD LTDDEMAKNH IRHMVGGHAP
AVCEKGMREL LYRFEFPEKP GALMNFLTQM SAGWNISLFH YRNHGADYGR VLVGMQVPPD
DMGKFKEFLT NLGYAHWDES QNPAYKLFLG