Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tbd_0538 |
Symbol | |
ID | 3673789 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thiobacillus denitrificans ATCC 25259 |
Kingdom | Bacteria |
Replicon accession | NC_007404 |
Strand | - |
Start bp | 561750 |
End bp | 562736 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637709209 |
Product | capsule expression protein KpsF/GutQ |
Protein accession | YP_314296 |
Protein GI | 74316556 |
COG category | [K] Transcription [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0794] Predicted sugar phosphate isomerase involved in capsule formation [COG2524] Predicted transcriptional regulator, contains C-terminal CBS domains |
TIGRFAM ID | [TIGR00393] KpsF/GutQ family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAGTCC AATCCCTCTC CGCCGAACAT TCACTCGCGC TCGCGCGTCA GGTCCTCGAG ATCGAAGCCG ACGCGCTGCA TACCGTTTCG ACACGTCTCG ATCACGGATT TGCCGACGCC GTTCGCCTGA TCCTCGCCTG CACCGGGCGG GTCGTCGTCT CCGGGATGGG CAAATCGGGG CACGTCGGTA GCAAGATCGC CGCCACCCTC GCGTCGACCG GAACGCCGGC ATTTTTCATG CATCCCGGGG AAGCGAGCCA CGGCGACCTC GGCATGATCG CGCACGACGA CGTCGTACTC GCGCTGTCCA ATTCCGGCGA AAGCAGCGAG ATCGTCTGCA TCGTGCCGCT CATCAAGCGA CGCGGCGCGA AGCTCGTCGC GATGACCGGC AACCCGGCGT CGACGCTCGC GCGCGAGGCC GATGCCCACC TCAACGCCAA GGTCGACAAG GAGGCGTGTC CGCTCAACCT CGCGCCGACC GCGAGCACGA CGGCGGCACT CGCGCTCGGA GACGCGCTCG CGGTCGCCCT GCTCGACGCA CGCGGTTTTT CGGCCGACGA TTTCGCGCGC ACCCACCCGG GCGGCAGCCT CGGGCGGCGG CTGCTCGTTC ATGTCGCCGA CGTCATGCAC GGCGGCGACG CGCTGCCGAA AGTCGGGCGC GACGCGACGC TCAAGGCCGC GCTGTTCGAA ATGACCAAAA AGGGCCTGGG CATGACCGCC GTGGTCGATG CCGACGACCG CGTCGTCGGG CTCTTCACCG ACGGCGACCT GCGTCGGACG CTCGAACATG CTCTCGACAT CCAGCACGCG AAAATCGCCG ACCTCATGAC ACCGAACCCG AAGACGATCC GCGCGGACGA ACTCGCCGCC GCGGCAGTGG AGAAAATGGA GACGCTGAAA ATCAACGGTC TGCTCGTCGT CGACGCCGAC AATCGCCTGG TCGGCGCCCT CAACATGCAC GACCTGCTGA AAGCGGGGGT GGTGTGA
|
Protein sequence | MQVQSLSAEH SLALARQVLE IEADALHTVS TRLDHGFADA VRLILACTGR VVVSGMGKSG HVGSKIAATL ASTGTPAFFM HPGEASHGDL GMIAHDDVVL ALSNSGESSE IVCIVPLIKR RGAKLVAMTG NPASTLAREA DAHLNAKVDK EACPLNLAPT ASTTAALALG DALAVALLDA RGFSADDFAR THPGGSLGRR LLVHVADVMH GGDALPKVGR DATLKAALFE MTKKGLGMTA VVDADDRVVG LFTDGDLRRT LEHALDIQHA KIADLMTPNP KTIRADELAA AAVEKMETLK INGLLVVDAD NRLVGALNMH DLLKAGVV
|
| |