Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tfu_0541 |
Symbol | |
ID | 3580339 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermobifida fusca YX |
Kingdom | Bacteria |
Replicon accession | NC_007333 |
Strand | + |
Start bp | 613064 |
End bp | 616120 |
Gene Length | 3057 bp |
Protein Length | 1018 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637684229 |
Product | hypothetical protein |
Protein accession | YP_288602 |
Protein GI | 72160945 |
COG category | [S] Function unknown |
COG ID | [COG1615] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCTTCC GATCGCCAGG CGCGCCCACT GCGCGCATGC CCCGACGGTC CCGGTTGCTC GCGCCGGTCG GCGCAGCCGT GGTCGTCATC ATCGCGGGCA TCATGTTCGC CGCGAATTTC TGGACCGAGT ACCGGTGGTT CGGTTCGGTC GGCTACACCA CGGTCTTCTG GACGGAGCTG CGCACCCGGG CACTGCTTTT TGCCGGGGGA GCGCTGCTGA TGGCCCTCGC CGTCGGCCTC AGCGTCTACT TCGCTTACCG GACCAGACCC GCCTACCGGC CGTTCAGCCT GGAACAGCAA GGGCTTGACC GCTACCGGTC CTCTATCGAC CCTCACCGCA AAGTCTTCTT CTGGGGGCTT GTCGGCGGGC TTGCACTGCT CACCGGGGCC TCTGCGACCG CGGAGTGGCA GACTTTCCTG CAGTTCGCCA ACGCCACGAA GTTCGGAGCC CAAGACGCCC AGTTCGGGCT GGACATCTCG TTCTACACCT TCACGTACCC GTTCCTCCAG GTGATCATCG GCTACCTGTA CACGGCCGTG GTCATCGCGT TCATCGCGGG TGTGGTGGTG CACTACCTGT ACGGCGGGGT GCGGCTGCAA GCCCAAGGGC AGCGCGTCAC CCCGGCAGCG CGCGTGCACC TGTCCGTCCT GCTGGGCGTC TTCCTGCTGC TGCGGGCTGC CGACTACTGG CTGGAGCAGT ACGGGCTGGT CTTCTCTAAC CGCGGCTACA CCTTCGGCGC GTCCTACACC GACGTCAACG CGGTCCTGTA CGCCAAGATC ATCCTGTTCT TCATCGCGCT GGTGTGCGCG GTGCTGTTCT TCGCCAACAT CTACTTCAAG AACGCCAGGG TGCCGCTGGT CAGCCTGGGA TTGATGGTGC TCTCGGCGAT CCTCATCGGC GGGGTCTACC CGGCGATCGT CCAGCAGGTC ACCGTCTCCC CCAACGAGCA GCGGCTGGAA CGGCCCTACA TCCAGCGCAA CATCGAGGCC ACCCGGGCCG CGTACGGTAT CGACGGCGCC GAGGTGATCG ACTACGACGC GCAGACCGAG CTGACCACCG CGGAGCTGGC AGCGGAGGCC GAGACCATTC CCAGCGTGCG CCTGGTGGAC CCGGCCGTGG TCTCGCAGAC TTTCCAGCAG CTCCAGCAGG TCCGCGGCTT CTACCAGTTC CCCCAGGTCC TGGAGGTCGA CCGGTACACC ACCTCGGACG GCGAGACGGT GGACACGATC GTGGCAGCCC GGGAACTCGA CGGTCCCCCC AGCCAGGAGG ACCGGTGGCT GACCCGCCAC CTGGTCTACA CCCACGGTTT CGGCATGGTC GCCGCCGCGG GCAACCAGGT GGACTCCGAG GGGCGTCCAG TGTTCCTGGA GTACAACATT CCGCCGACCG GTGAACTGTC CCAGGTGGGG GAGGGCTACG AGCCGCGCAT CTACTTCGGC CGCGAAGGCG CCGAGTACGT GATCGTCAAC GCCGAAGCCG AGTACGACTA CCCGGTGGAC CCCGACACGC CGGAGGTTCC CACCACCGAA GACGCGGTCG TTCCGACCCC TTCCCCGTCG CCGGAGGCGG ACGAAGCTCC AGCACCTGCC GACAGCCGCG AGGAAGGCTC CGCGCAAGAG CAGCAGGACC AGGAAGGCCA GGACGGCGGG GAGGACGCCC AGGGCACTGA GGAGGAGCAG TCCGGCCAGG GCAGCGGCCA GGCCAACAAC TACTACGACG GTAAAGGCGG CGTGCAGCTC AAGAGCTTCT TCGACAAGCT CATGTACGCG CTCAAGTACC AGGAGATCAA CATCCTCCTC AACAACGCGA TCAGCAACGA GTCGCAGATC ATCTACGTCC GCGACCCTGC TGAGCGGGTG GAGAAGGTCG CTCCTTTCCT CACCGTGGAC GGCAAGGCGT ACCCGGCGGT GGTGGACGGC CGGATCGTGT GGATCGTCGA CGCCTACACC ACTTCGGACC GGTACCCCTA CTCCACGCCG ATCGACCTGG CCCAAGCCAC GACGGACACC TTCACCGAGA GCACCACGGC GGTCAACGCG CTGCCGGGCA ACCGGGTGAA CTACATCCGC AACTCGGTCA AGGCCACCGT GGACGCCTAC GACGGGACCG TGACCCTGTA CGGCTGGGAC GAGGAAGACC CGGTCCTGCA GACCTGGTCC AAGGCGTTCC CTGGAGTGGT CACCAGCAAG GACGAGATCA GCGACACGCT GCTGTCCCAC CTGCGCTACC CGGACGACCT GTACAAGGTG CAGCGGGAGA TCCTGGAGCG CTACCACATC ACCAACGCCG ACGCCTTCTA CGGCGGCCAG GACTTCTGGA CCGTGCCCAA CGACCCGAAG CCGCAGGCGG GCAACAACCC GGAGCCGCCC TACCGCCAGA CCATCCGCTT CCCGGGCGAC GACACGCCGA CGTACTCGCT GACGTCGACC TTCGTGCCCC GGGGCCGGGA GAACCTGGCG GCGTTCATGG CCGTGAACAG CGACGCGTCG TCGGAGGACT ACGGCCAGAT GCGCATCCTG GAGCTGCCGC GGAGCACCGC GGTCCAAGGG CCGGGGCAGA TCCAGAACAC CTTCCAGTCC TCGGCTGAGG TGCGTGAAGT GCTGCTCCCC TTGGAGCAGA GCTCGGCCCA GGTCACCTAC GGCAACCTGC TCACCTTGCC TTTCGCTGGC GGTCTGCTCT ATGTGGAACC GCTCTACGTG CAGGCCGGGG GCAGCGACGC CTCCTATCCG CTGCTGCAGC AGGTCCTGGT CGGCTTCGGC GACCAGGTCG CTATCGGCAG CAACCTCCAG GAGGCGCTGA ACAACCTCTT CGACGGGGAC GAGGCCCCCT TGGAGGAGCC CACCACAGAC GGAGAGGCCC GGGAGGAAGA GGAGCAGCCG CAGGCGAGCA GCGACCTCGC CCAGGCGCTG GAGGACGCCG CGGAAGCGTA CGAAGAGGGT CAGGCAGCGC TGCGGGAAGG CGACTTCGCC GCCTACGGCG AGGCCAACGA GCGGTTGAAG GAGGCCCTCG ACCGCGCGAA GGCCGCATCC GGAAGCAACG AGGAGAAGGA CGAGTAG
|
Protein sequence | MSFRSPGAPT ARMPRRSRLL APVGAAVVVI IAGIMFAANF WTEYRWFGSV GYTTVFWTEL RTRALLFAGG ALLMALAVGL SVYFAYRTRP AYRPFSLEQQ GLDRYRSSID PHRKVFFWGL VGGLALLTGA SATAEWQTFL QFANATKFGA QDAQFGLDIS FYTFTYPFLQ VIIGYLYTAV VIAFIAGVVV HYLYGGVRLQ AQGQRVTPAA RVHLSVLLGV FLLLRAADYW LEQYGLVFSN RGYTFGASYT DVNAVLYAKI ILFFIALVCA VLFFANIYFK NARVPLVSLG LMVLSAILIG GVYPAIVQQV TVSPNEQRLE RPYIQRNIEA TRAAYGIDGA EVIDYDAQTE LTTAELAAEA ETIPSVRLVD PAVVSQTFQQ LQQVRGFYQF PQVLEVDRYT TSDGETVDTI VAARELDGPP SQEDRWLTRH LVYTHGFGMV AAAGNQVDSE GRPVFLEYNI PPTGELSQVG EGYEPRIYFG REGAEYVIVN AEAEYDYPVD PDTPEVPTTE DAVVPTPSPS PEADEAPAPA DSREEGSAQE QQDQEGQDGG EDAQGTEEEQ SGQGSGQANN YYDGKGGVQL KSFFDKLMYA LKYQEINILL NNAISNESQI IYVRDPAERV EKVAPFLTVD GKAYPAVVDG RIVWIVDAYT TSDRYPYSTP IDLAQATTDT FTESTTAVNA LPGNRVNYIR NSVKATVDAY DGTVTLYGWD EEDPVLQTWS KAFPGVVTSK DEISDTLLSH LRYPDDLYKV QREILERYHI TNADAFYGGQ DFWTVPNDPK PQAGNNPEPP YRQTIRFPGD DTPTYSLTST FVPRGRENLA AFMAVNSDAS SEDYGQMRIL ELPRSTAVQG PGQIQNTFQS SAEVREVLLP LEQSSAQVTY GNLLTLPFAG GLLYVEPLYV QAGGSDASYP LLQQVLVGFG DQVAIGSNLQ EALNNLFDGD EAPLEEPTTD GEAREEEEQP QASSDLAQAL EDAAEAYEEG QAALREGDFA AYGEANERLK EALDRAKAAS GSNEEKDE
|
| |