Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tfu_2833 |
Symbol | |
ID | 3581827 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermobifida fusca YX |
Kingdom | Bacteria |
Replicon accession | NC_007333 |
Strand | + |
Start bp | 3328034 |
End bp | 3330280 |
Gene Length | 2247 bp |
Protein Length | 748 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637686558 |
Product | putative secreted protein |
Protein accession | YP_290889 |
Protein GI | 72163232 |
COG category | [S] Function unknown |
COG ID | [COG2268] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACACTA TTGCCACGGG GGCCGGTGTA CTGATCGCTG TCCTCCTCCT CGTCCTCCTC GGACTGGTCA TCATCGTCAG CCGCCTGTTC CGCAAAGTGG AACAGGGCAA GGCCCTGATC ATCTCCAAGG TCCGCAAGGT CGACGTCACC TTCACCGGCG CCATCGTGCT CCCCGTGCTG CACAAAGCCG AGGTGATGGA CATCTCGGTG AAGACCATCG AGATCTCCCG CATGGGGAAA GACGGCCTGA TCTGCCGGGA CAACATCCGC GCTGATATCC GCATCACGTT CTTCGTGCGG GTCAACAAGA CCGTCGAAGA CGTCATCAAG GTCGCCCAAG CGATCGGCGC GGAACGCGCC AGCGACCAGG CCACGCTGCA GGAACTGTTC AACGCCAAGT TCTCCGAGGC CTTGAAGACC GTGGGCAAGC ACTTGGACTT CGAGGACCTC TACACCAAGC GCGCCGAGTT CCGTGACCGG ATCATCAGCG TGATCGGCAC CGACCTCAAC GGCTACATCC TCGAGGATGT GGCGATCGAC CACCTGGAGC AGACTCCCCT GTCCCAGCTC GATCCCCACA ACATCCTCGA CGCACAAGGC ATCCGCAAGA TCACCGAGCT GACCACCGAG CAGAACATCC GCACCAACGA GCACCGCCGC AACGAGGAGA AGGAAATCGA GCGCCAGAAC GTCGAGGCGC GCGAAGCCAT CCTCAAACTG GGACGCCGCC GCACGGAGGC TGAGACCAGG CAGAAGCGCG AAATCGAGAC GATGCGCGCC CAGGCCGAAG CCGAAACCGA GATCGTGCGG GCCGAGGAGT TGCTGCGCGC CGAGATGGCG CGGATCCGCA CCGAACAGGA GCTCGGTGTC CAGCGGGAAA ACCGGGAGCG GGAGATCGCG GTCGCCGCGA AGAACCGCGA ACGGGTCCTC GCTATCGAAA CGGAACGCAT CGAAAAGGAC CGGCTCCTGG AAGTCATTGC CCGGGAACGC GAAACCGAAC TGTCCCGCTA CGCCAAGGAC AAGGAGCTGG AAGTCGAACG GCGCCAGATC GCCGAAGTCG TCCGCGAACG CGTCGCCGTA GAGAGGACCG TCGCCGAACA AGAGGAGAGC ATCAAGCGGC TGCGCGCGGT TGAGGAGGCG GAGCGCCACC GGCAGACCGC GGTCATTCTC GCTGAGAGCG AGGCCCAGGA GAAGCTCGTC AAGGACATCA AGGCCGCGGA GGCCGCGGCG CAAGCCGCTG AGCACACGGC GCGCAAGCAG CTCACTCTCG CAGAGGCCCG CCAGCGGACC GCGGAGCTGG AGGCTGACGC CACGATCCGC CTGGCGGAAG GCACCCAGGC TCAGGCCGCG GCCGAAGGGT TGGCCCAGGT GCAGGTGAGC GAGCGCGAAG CCGAAGCGTT GGAGAAGCTG GGCCGGGCCG AGGCTGCGGT GGCCCGCGAA AAGGCGCTGG CCCGGGCCGA AGAGATCGAG CGGATCGCCC AGGCCGAAGC CGCTGCCGAC CGGCAGAAAG CCCTCGCTCG CGCCGAAGAA ATCGAAAAGG TCGCCCAAGC CGAGGCCACT GCGGACCGCC AGAAAGCCCT GGCCCACGCC GAAAAGATCG AACAGATCGG CCAGGCCGAG GCCACTGCCG ACCGCCAGAA AGCCCTCGCC GCGGCCGAAG GGGCCCGGGA GAAGCTCAAG GCCGATGCCG AAGGGGTCCG TGAGAAGCTC AAGGCGGAAG CCGAAGGGAT CCACGACAAG GCCGAAGCGA TGGCCGCGCT CAACGAAGCA ACCCGGGAAC ACGAGGAGTA CCGCCTGCGG CTGGAAGCCG AGAAGGAAGT CCGGCTCGCC GGGATCGACG CGCAGCGGCA GATTGCCGAA GCCCAGGCCA CGGTGCTCGG CAAGGGCTTG GAGAACGCCG ACATCGACAT TGTCGGCGGC GACGGCGCGT TCTTCGACCG GGTCATCAAC GCGGTCAGCG CAGGCAAGGC CGTGGACAGC TTCATGGACC ATTCAGCGAC CGCGCGCACC CTGGCCGGTC CGTGGCTGGA CGGCTCCCGG GACCTGCCCG CCGATCTGAC CCGCGTGTTG AGCGGGATCG GTTCGGAGGA CGTGAAGAAC CTGTCCTTGT CGGCACTGCT CCTCACGCTG ATCAACGCGG GCGGTCCGCA GGCGAAGCAA CTGTCCGCCC TGCTGGAGTC CGCCCGGAAA CTGGGGGTGG CTGACCTGCC GCTCGCCGAC CTCACCAGCC CTGTACCGGA GAAGTGA
|
Protein sequence | MDTIATGAGV LIAVLLLVLL GLVIIVSRLF RKVEQGKALI ISKVRKVDVT FTGAIVLPVL HKAEVMDISV KTIEISRMGK DGLICRDNIR ADIRITFFVR VNKTVEDVIK VAQAIGAERA SDQATLQELF NAKFSEALKT VGKHLDFEDL YTKRAEFRDR IISVIGTDLN GYILEDVAID HLEQTPLSQL DPHNILDAQG IRKITELTTE QNIRTNEHRR NEEKEIERQN VEAREAILKL GRRRTEAETR QKREIETMRA QAEAETEIVR AEELLRAEMA RIRTEQELGV QRENREREIA VAAKNRERVL AIETERIEKD RLLEVIARER ETELSRYAKD KELEVERRQI AEVVRERVAV ERTVAEQEES IKRLRAVEEA ERHRQTAVIL AESEAQEKLV KDIKAAEAAA QAAEHTARKQ LTLAEARQRT AELEADATIR LAEGTQAQAA AEGLAQVQVS EREAEALEKL GRAEAAVARE KALARAEEIE RIAQAEAAAD RQKALARAEE IEKVAQAEAT ADRQKALAHA EKIEQIGQAE ATADRQKALA AAEGAREKLK ADAEGVREKL KAEAEGIHDK AEAMAALNEA TREHEEYRLR LEAEKEVRLA GIDAQRQIAE AQATVLGKGL ENADIDIVGG DGAFFDRVIN AVSAGKAVDS FMDHSATART LAGPWLDGSR DLPADLTRVL SGIGSEDVKN LSLSALLLTL INAGGPQAKQ LSALLESARK LGVADLPLAD LTSPVPEK
|
| |