Gene Tbd_1914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTbd_1914 
Symbol 
ID3674024 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThiobacillus denitrificans ATCC 25259 
KingdomBacteria 
Replicon accessionNC_007404 
Strand
Start bp2009757 
End bp2010956 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content67% 
IMG OID637710613 
Producttryptophan synthase subunit beta 
Protein accessionYP_315672 
Protein GI74317932 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0133] Tryptophan synthase beta chain 
TIGRFAM ID[TIGR00263] tryptophan synthase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.148244 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.536089 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCTGA CTGACCTTCC CGATTCCCGC GGCCACTTCG GTCCCTACGG CGGCATTTTC 
GTTTCCGAAA CGCTGATGGC CGCGCTCGAC GCGCTGCGTG TCGAATACGA CGCGGCGTGC
CGGGACCCCG GCTTCATGGC CGAATTCGAG TACGAACTCA AGCACTACGT CGGGCGGCCG
AGCCCGGTCT ACCACGCCCG GCGGCTTTCG GAGGAATACG GCGGCGCGCA GATCTACCTC
AAGCGCGAGG ATCTCAACCA CACCGGCGCG CACAAGATCA ACAACACGAT CGGCCAGGCG
CTGCTCGCGC GCCGCATGGG CAAGAAGCGC GTCATCGCCG AGACCGGCGC GGGCCAGCAC
GGCGTCGCGT CGGCGACCGT CGCCGCGCGC TACGGCATGG AATGCGTTGT CTACATGGGC
GCCGAAGACG TCGCGCGACA GGCCCCCAAC GTCTTTCGTA TGAAGCTCCT CGGCGCGACC
GTCGTGCCCG TGTCGTCGGG TTCGAAGACG CTGAAGGACG CGCTGAACGA AGCGATGCGC
GACTGGGTGA CGAACGTCGA GTCGACCTTC TACATCCTCG GCACCGCGGC CGGCCCGCAT
CCCTACCCGA TGCTCGTGCG CGACTTCCAG TGCGTGATCG GGCGCGAATG CATCGCGCAG
ATGCCCGAGC TCGTCGGACG CCAGCCCGAC GCGGTCGTCG CCTGCGTCGG CGGCGGCTCG
AACGCGATCG GAATTTTCCA TCCCTACATT CCCCATGAGA ACGTGCGCCT GATCGGTGTC
GAAGCCGGCG GTTCGGGGGT CGCGAGCGGC AAGCACGCTG CGCCGCTGAC CGCCGGCACG
CCCGGGGTGT TGCACGGCTT TCGCAGCTAC CTGATGCAGG ACGAGAACGG CCAGATCATC
GAGACCCATT CGGTCTCGGC CGGCCTCGAC TATCCGGGCG TCGGCCCCGA GCACAGCTAT
CTCAAGGACG CCGGTCGCGC CGAATACGTG CCGATCAACG ACGACGAAGC GCTCGCCGCC
TTCCACGATC TGTGCCGCTT CGAGGGCATC ATCCCCGCGC TCGAGTCGAG CCACGCGGTG
GCGCAGGCGA AGAAACTCGC GCCGACGATG AAGAAGGACC AGGTCATTCT GGTGAACCTC
TCGGGGCGCG GCGACAAGGA CATCAACACC GTGGCGAAGG CGGCGGGCAT CACGCTCTGA
 
Protein sequence
MKLTDLPDSR GHFGPYGGIF VSETLMAALD ALRVEYDAAC RDPGFMAEFE YELKHYVGRP 
SPVYHARRLS EEYGGAQIYL KREDLNHTGA HKINNTIGQA LLARRMGKKR VIAETGAGQH
GVASATVAAR YGMECVVYMG AEDVARQAPN VFRMKLLGAT VVPVSSGSKT LKDALNEAMR
DWVTNVESTF YILGTAAGPH PYPMLVRDFQ CVIGRECIAQ MPELVGRQPD AVVACVGGGS
NAIGIFHPYI PHENVRLIGV EAGGSGVASG KHAAPLTAGT PGVLHGFRSY LMQDENGQII
ETHSVSAGLD YPGVGPEHSY LKDAGRAEYV PINDDEALAA FHDLCRFEGI IPALESSHAV
AQAKKLAPTM KKDQVILVNL SGRGDKDINT VAKAAGITL