Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tbd_0195 |
Symbol | |
ID | 3673641 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thiobacillus denitrificans ATCC 25259 |
Kingdom | Bacteria |
Replicon accession | NC_007404 |
Strand | - |
Start bp | 206569 |
End bp | 209364 |
Gene Length | 2796 bp |
Protein Length | 931 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637708856 |
Product | TPR repeat-containing protein |
Protein accession | YP_313953 |
Protein GI | 74316213 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2956] Predicted N-acetylglucosaminyl transferase |
TIGRFAM ID | [TIGR02917] putative PEP-CTERM system TPR-repeat lipoprotein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTCGAC ATCCCGCTCC CCTGATTGCG TCGCTGCTGG TCCTGGCCTT TTGCGCGCCG CTCGCCGGCT GCGATCCCAC GGCCGGACTC AGCGCCCAGG AACACGTCCA GCGCGCCAAG GACTTCGAGG ACAAGGGCGA CCTCAAGGGC AGCGTCATCG AACTGAAGAA CGCGATCCAG AAGAATCCCG ACAGCGCCGA AGCCCGGCTC CTGCTGGGAC AGGTCTACCT CAAGGCGGGC TTCGGCGCCG AAGCGGAAAA GGAACTTCGA CAAGCCGAGC GGCTCGGCGT CGGCCGCGCC ACCCTCGAGC CCCTGCTCGG GGAAGCCCTG CTGCTGATGG GCGAGTACGC GCGCGTCCTC GACGAAATCC AGCCGGACAC GCAGGGACCG AAGGAGCGCC TGTCGCGCAT CCTGCAACTG CGTGGCGAGG CCCTGCTGAA CCAGCGGAAA CTGGAGGAGG CGTGCAATCT GTTCCAGCAG TCCTACGATG CCTCGCCCGG CAACCCGCCC ACCTACTGGG GCCTTTCGCG CTGCGCGCTG GCAACGGGTG ACGCGGCGAA GGCGCGCGAC TGGCTTGAAC GCGCGCTCAA GCTCGAGCAC AAGCGCGCCC GCACCTGGAT TCACCTCGGC AACCTCGAAT TGGCCGGCAA GGATACGGCG AAGGCGCTCG CTGCCTATTC GAAGGCTGTG AAGATCGAAC CGAACAATCT GGATGCGCTG CAGAGTCTGG TCGCGATTCA CGTCAAGGCG GGAGACACCC AGCGCGCGCG CGAGTACTTG GCCGTGATCA GGAAGCTCGC GCCCAAATCG ACCCGCGCAC ATTACCTCGA GGCGTCGATC GCCTACAGCG AGAAGAAATT CGCCGAGGCA AACGCCGCGA TTCAGGAAGC CCTGAAAGTC TCGCCCGACC ATGTTCCGAG CCTGATGCTC GCCGGCATGA GCGCCCATGC GCTCGGCTCC TACCAGGAGG CGGAAACGTA TTTCAAGCGC TTTCTGCTGC GGGTTCCCGG CCACGCGGAA GGGCTCAAGA TGCTTGCGAC GACGCAAATC AAGTCGAAGC AATTCGACAA GGCGCTCGTC ACGCTCGCCC CTTTCCTCGC CCCCGGGGTG CGGGATGCAC AGGGTCTGGC GCTGGCGGGC GAAGCGCAGA TGGCCAACGG CAACCCGAGC CAGGCCGCGG CGCTCTTCGA ACGCGCGCTC GCGCTCGAAC CCGGCAACGT CACGATACGT ACGCAGCTCG GCCTGAGTCA GCTCGCCGCC GGGAACACGC AAGACGCCAT CGACGAGTTG ACCGATGCCT CACAGCATTC TTCGGGCTCC CAGGCGGACA CGCTGCTTGC GGTCGCCTAT CTGAGCCGCA AGGATTACGA CCGCGCACTC GCCGCGCTTG CGACCCTACA GAAAAAAGGC GACGCCAGCG CGAAAATCCA TCACCTGGCC GGGCAGGCCT ACCTCGGCAA GAACGACAAG CTTGCCGCCC GCCGTAATTT CGAACAGGCG CTCGCCGCCG ACGCGGCGTT CTTCCCCGCG GTCGCCAGCC TCGCGCAGCT CGACGTGGCC GAGAACAAGG CGGACGCGGC CCGCATGCGT CTCGAGCGCG CGCTCGCCCA GGACAAGAAC CGGGTCGCCG CGATGCTCGC GCTCTCGCGG ATGGCTGCCC GCAATGGTCA GGAGCAGGCC TCGATCGACT GGCTCGAGAA AGCCGCCCGC GCCGACGGCA AGGCGATACA GCCGCGCATC GAACTGGTAC GGCATTACCT GGCCCGCAAC GAGGGCCAGA AGGCGCTCGC CCTGGCCAAC GAGGCGGTCC GCGCCAACCC CGACCACCCT GCCGCGCTCA ACCTGCTCGG CACGGTGCAA CTCGCGCTCG ACGACAAGGC GAGTTCGGCG AGCACCTTCA GCCGGCTTAC CCGGGAGACT AGGCAGTCGC CGGAAGGCTT CGTGCGTCTC GCGCAGGTGC AGCTGGCCGA CGGTAAACTC GACGAAGCGC GCCGCAACCT GCTGCACGCG CTGGAACTCG CGCCGGGACA TCTCAAAAGC CAGGAGGCAT TGATCAAGCT GGAACTCGCC GCCAAGCGCC CCGAGGCCGC GCTTCTCGTC GCGCGCGACA TCCAGAAGGG CCACCCCGAT TCCGCCGTCG GCTTCGTACG CGAAGGCGAC ATTCTGCTCG CCGAAAAACG CATCGCGCAG GCTGTGCCGG CCTACGTCCG CGCGCTGGAA CATGGCGCCG GGCCGGCTGT GCTGGTCCAG TTCCACCGAG CGACCGTCCT CTCGGGGCAG AACCGTGCGG CGGCCGACCG CCGGCTCGAG GACTGGATCC GGCAGCATCC GAAGGACAGC GGCGTCGCCG CATACGCAGC CGGGTACTAC CTTGTCACCG GGCAAAGCGC GCGCGCTGCG GAGACTTATC GGCAGATCCT GAAGCACGAA CCACGCAACG TCATGATCCT GAACAATCTC GCCAGCCTCT ATCTGCAGCA GCGAGACCCG CGCGCGCTCG AGCTCGCGAC CCAGGCCAAC CGACTCGCGC CGACCAACCC GGCCGTCCAG GACACCCTGG GCTGGGTTCT GGTCGAACAA GGCCAGGCCC GGCGCGGACT CGGGTACCTG CGCAAGGCGA TGGCCCAGAC ACCGAAGAAC GCGAGCCTGC GCTACCACCA CGCGGTGGCG CTCGCCCGCA CCGGAGACCG CCCAGGTGCG CGCAAGCTGC TCGAGCAGCT GCTCGCCGAA ACGCCGCGCT TCGAGGAACG CGCTGCGGCG GAGACCCTAC TCAAGAGCCT GCCGGCTGCT TCCTGA
|
Protein sequence | MTRHPAPLIA SLLVLAFCAP LAGCDPTAGL SAQEHVQRAK DFEDKGDLKG SVIELKNAIQ KNPDSAEARL LLGQVYLKAG FGAEAEKELR QAERLGVGRA TLEPLLGEAL LLMGEYARVL DEIQPDTQGP KERLSRILQL RGEALLNQRK LEEACNLFQQ SYDASPGNPP TYWGLSRCAL ATGDAAKARD WLERALKLEH KRARTWIHLG NLELAGKDTA KALAAYSKAV KIEPNNLDAL QSLVAIHVKA GDTQRAREYL AVIRKLAPKS TRAHYLEASI AYSEKKFAEA NAAIQEALKV SPDHVPSLML AGMSAHALGS YQEAETYFKR FLLRVPGHAE GLKMLATTQI KSKQFDKALV TLAPFLAPGV RDAQGLALAG EAQMANGNPS QAAALFERAL ALEPGNVTIR TQLGLSQLAA GNTQDAIDEL TDASQHSSGS QADTLLAVAY LSRKDYDRAL AALATLQKKG DASAKIHHLA GQAYLGKNDK LAARRNFEQA LAADAAFFPA VASLAQLDVA ENKADAARMR LERALAQDKN RVAAMLALSR MAARNGQEQA SIDWLEKAAR ADGKAIQPRI ELVRHYLARN EGQKALALAN EAVRANPDHP AALNLLGTVQ LALDDKASSA STFSRLTRET RQSPEGFVRL AQVQLADGKL DEARRNLLHA LELAPGHLKS QEALIKLELA AKRPEAALLV ARDIQKGHPD SAVGFVREGD ILLAEKRIAQ AVPAYVRALE HGAGPAVLVQ FHRATVLSGQ NRAAADRRLE DWIRQHPKDS GVAAYAAGYY LVTGQSARAA ETYRQILKHE PRNVMILNNL ASLYLQQRDP RALELATQAN RLAPTNPAVQ DTLGWVLVEQ GQARRGLGYL RKAMAQTPKN ASLRYHHAVA LARTGDRPGA RKLLEQLLAE TPRFEERAAA ETLLKSLPAA S
|
| |