Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tbd_0666 |
Symbol | |
ID | 3672631 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thiobacillus denitrificans ATCC 25259 |
Kingdom | Bacteria |
Replicon accession | NC_007404 |
Strand | + |
Start bp | 706654 |
End bp | 707754 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637709338 |
Product | chorismate synthase |
Protein accession | YP_314424 |
Protein GI | 74316684 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0639836 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGGCA GCACCCTCGG CAAACTGTTC TGCGTGACCG TATTCGGCGA GTCGCACGGC CCCGCGATCG GCTGCGTCGT CGACGGCTGC CCGCCCGGCA TGACGCTGGG CGAATCCGAC ATCCAGCACG ATCTCGACCG GCGCAAGCCC GGCACCTCCC GCCACGTCAC GCAACGTCGC GAATCCGACA CGGCCGAGAT TCTTTCCGGC GTCTACGAAG GCAGGACCAC CGGCACGCCG ATCGCGCTGC TGATCCGCAA CGAGGACCAG CGCAGCAAGG ACTACGGAAA CATTGCCGCG ACCTTCCGGC CGGGGCACGC CGACTATACC TATACGCAGA AATACGGCTT TCGCGACCCG CGGGGAGGCG GCCGTTCGTC TGCGCGGCTG ACTGCGCCGA TCGTCGGCGC CGGCGCCATC GCGAAGAAGT GGCTGAAGGA AAAATACGGC ATCGTGATCC GCGGCTACAT GAGCGCGCTC GGCCCGCTCG ACATTCCCTT CGAATCCTGG GATGAAGTCG ACAACAACGC CTTCTTCTCG CCCAACGCCG CGATCGTGCC CGAACTCGAG CAATACATGG ACGCGCTGAG AAAATCCGGC GACTCGGTCG GTGCGCGCGT CAGCGTCGTC GCCGAGAACG TGCCGCCCGG CTGGGGCGAG CCGCTGTACG ACAAGCTCGA CGCCGACCTC GCCCACGCGC TGATGGGCCT GAACGCCGTC AAGGGCGTCG AGATCGGCGA CGGCATGCAG GCCGCGCGAC AGCTCGGCAC CGAGCATCGC GACGAGATCA CCCCCGCGGG ATTTCTCTCC AACCATGCCG GCGGCGTGCT CGGCGGCATC TCGTCGGGGC AGGCGATCGT CGCCCACGTC GCGATCAAGC CGACCTCGTC GATGCGCCTG CCCGGGCGCT CGGTCGACCT CGATGGCCAG CCGATCGAGG TCGTCACCCA CGGCCGGCAC GACCCCTGCG TCGGCATCCG CGCGACGCCG ATCGTCGAGG CGCTGACCGC GATCGTGCTG ATGGACCATG CGCTGCGCCA CCGCGCGCAG TGCGGCGATG TCGCGAGCGG CGTTCCGATC GTGCCCGCAC GGCTGGACTG A
|
Protein sequence | MSGSTLGKLF CVTVFGESHG PAIGCVVDGC PPGMTLGESD IQHDLDRRKP GTSRHVTQRR ESDTAEILSG VYEGRTTGTP IALLIRNEDQ RSKDYGNIAA TFRPGHADYT YTQKYGFRDP RGGGRSSARL TAPIVGAGAI AKKWLKEKYG IVIRGYMSAL GPLDIPFESW DEVDNNAFFS PNAAIVPELE QYMDALRKSG DSVGARVSVV AENVPPGWGE PLYDKLDADL AHALMGLNAV KGVEIGDGMQ AARQLGTEHR DEITPAGFLS NHAGGVLGGI SSGQAIVAHV AIKPTSSMRL PGRSVDLDGQ PIEVVTHGRH DPCVGIRATP IVEALTAIVL MDHALRHRAQ CGDVASGVPI VPARLD
|
| |