Gene Tbd_0666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTbd_0666 
Symbol 
ID3672631 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThiobacillus denitrificans ATCC 25259 
KingdomBacteria 
Replicon accessionNC_007404 
Strand
Start bp706654 
End bp707754 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content68% 
IMG OID637709338 
Productchorismate synthase 
Protein accessionYP_314424 
Protein GI74316684 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0639836 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGGCA GCACCCTCGG CAAACTGTTC TGCGTGACCG TATTCGGCGA GTCGCACGGC 
CCCGCGATCG GCTGCGTCGT CGACGGCTGC CCGCCCGGCA TGACGCTGGG CGAATCCGAC
ATCCAGCACG ATCTCGACCG GCGCAAGCCC GGCACCTCCC GCCACGTCAC GCAACGTCGC
GAATCCGACA CGGCCGAGAT TCTTTCCGGC GTCTACGAAG GCAGGACCAC CGGCACGCCG
ATCGCGCTGC TGATCCGCAA CGAGGACCAG CGCAGCAAGG ACTACGGAAA CATTGCCGCG
ACCTTCCGGC CGGGGCACGC CGACTATACC TATACGCAGA AATACGGCTT TCGCGACCCG
CGGGGAGGCG GCCGTTCGTC TGCGCGGCTG ACTGCGCCGA TCGTCGGCGC CGGCGCCATC
GCGAAGAAGT GGCTGAAGGA AAAATACGGC ATCGTGATCC GCGGCTACAT GAGCGCGCTC
GGCCCGCTCG ACATTCCCTT CGAATCCTGG GATGAAGTCG ACAACAACGC CTTCTTCTCG
CCCAACGCCG CGATCGTGCC CGAACTCGAG CAATACATGG ACGCGCTGAG AAAATCCGGC
GACTCGGTCG GTGCGCGCGT CAGCGTCGTC GCCGAGAACG TGCCGCCCGG CTGGGGCGAG
CCGCTGTACG ACAAGCTCGA CGCCGACCTC GCCCACGCGC TGATGGGCCT GAACGCCGTC
AAGGGCGTCG AGATCGGCGA CGGCATGCAG GCCGCGCGAC AGCTCGGCAC CGAGCATCGC
GACGAGATCA CCCCCGCGGG ATTTCTCTCC AACCATGCCG GCGGCGTGCT CGGCGGCATC
TCGTCGGGGC AGGCGATCGT CGCCCACGTC GCGATCAAGC CGACCTCGTC GATGCGCCTG
CCCGGGCGCT CGGTCGACCT CGATGGCCAG CCGATCGAGG TCGTCACCCA CGGCCGGCAC
GACCCCTGCG TCGGCATCCG CGCGACGCCG ATCGTCGAGG CGCTGACCGC GATCGTGCTG
ATGGACCATG CGCTGCGCCA CCGCGCGCAG TGCGGCGATG TCGCGAGCGG CGTTCCGATC
GTGCCCGCAC GGCTGGACTG A
 
Protein sequence
MSGSTLGKLF CVTVFGESHG PAIGCVVDGC PPGMTLGESD IQHDLDRRKP GTSRHVTQRR 
ESDTAEILSG VYEGRTTGTP IALLIRNEDQ RSKDYGNIAA TFRPGHADYT YTQKYGFRDP
RGGGRSSARL TAPIVGAGAI AKKWLKEKYG IVIRGYMSAL GPLDIPFESW DEVDNNAFFS
PNAAIVPELE QYMDALRKSG DSVGARVSVV AENVPPGWGE PLYDKLDADL AHALMGLNAV
KGVEIGDGMQ AARQLGTEHR DEITPAGFLS NHAGGVLGGI SSGQAIVAHV AIKPTSSMRL
PGRSVDLDGQ PIEVVTHGRH DPCVGIRATP IVEALTAIVL MDHALRHRAQ CGDVASGVPI
VPARLD