Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tfu_1353 |
Symbol | |
ID | 3580367 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermobifida fusca YX |
Kingdom | Bacteria |
Replicon accession | NC_007333 |
Strand | + |
Start bp | 1569342 |
End bp | 1570757 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637685050 |
Product | n-acetylgalactosamine-6-sulfate sulfatase |
Protein accession | YP_289414 |
Protein GI | 72161757 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.0725989 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCCTGGAA ACAGGGAACG CGCGTTCGAA AGCACTCGCC AGGGACGGCC CGACCAACCC CCCAACATCC TGTTCATCCT CGCCGACGAC CTCGGCTGGG CCGACCTGGG CTGCTACGGT TCCACCACTA TCCGCACTCC CAACCTCGAC CAGCTGGCCG CCCAAGGCAT CCGGTTCACC CACGGCTATG CGGGATCACC TTGGTGCTCC TCCACCAGGA TCAGTCTGTA CACGGGGCGC TACCCGGGAC GCTTGCAAGC CGGCCTCGAA GAACCGCTCG TCACCCGTTC CCCGGAAAAC GGCATCCCGG AAGGCCACCC CACCCTTTCC TCACTTCTCG TTGAGGCCGG GTACGCCACC GCCATGTTCG GTAAATGGCA CTGCGGCTGG CTGCCGTGGT ACAGCCCGTT GCGGATCGGC TTCGAAACCT TCTTCGGGAA CTTCGACGGG GCCCTCGACT ACTTCGAACA CGTCGACACT CTTGGCAAAG CCGACCTGTA CGAAGGGGAG ACCCCGGTTG AGGAAGTCGG CTACTACACG GAGATCATCT CCGAGCGGGC CGCTGAATAC ATCACCGCGC ACCGCAACCG GCCGTTCTAC GTGCAACTCA ACTACACCGC GCCCCACTGG CCGTGGGAAG GACCCGACGA CCACGAAGTC GGACAAGAAA TCCGCCGCCG CTACCAGCAG AGATGGGAGC ACTCCCCGCT GATGCACCTG GACGGCGGTT CGATCGCTAA ATACGGCGAA CTTGTGGAAG CCATGGACGC GGGCATCGGA CAGGTCCTGG CCGCTCTCGA CAGGGCCGGG GCCGCCGACA ACACTATCGT CGTTTTCTCC TCCGACAACG GTGGGGAACG CTGGTCGAAA AACTGGCCGT TCGTGGGGGA GAAAGGCGAC CTCACCGAAG GCGGCATCCG TGTCCCGCTC ATCGTGGCCT GGCCGGAAGC GATAGCCGGA AACCAGGTGA GCGACCATCC CGTCATCACC ATGGACTGGA CAGCGACCCT GCTCGCCGCC GCGGGAACCG AACCGCACCC CGACTGGCCG CTCGACGGTG TGGACCTGCT GCCGTGGCTG GTCGACGGCG CCGACTTTCC CGCCCACGAC CTGTTCTGGC GCACCTCCAA CCAGGGGGCG CTGCGGCGAG GCCGGTTCAA ATACCTGCGT GACCGGCGGG ACCGCGCCGT GCTCGGCAAC TGGCCGCGCC ACTACGGCGA CTACCACCTG CTCTACGACG TGACCGTGGA CGGCCGGGAA CGAGCTGATA TTGCCGGACA GCATCCCGAA GTGCTCGCCG AACTGCGGGA AGCCTGGGAG CGGATCGACG CGGAACTGCT GCCCTTCCCG ACCACCCACG TGGGCCTGCC GCGGCCCCGC ACCGAAGGAG CCCCGGCCGT GAGCGAGCCG GACTGA
|
Protein sequence | MPGNRERAFE STRQGRPDQP PNILFILADD LGWADLGCYG STTIRTPNLD QLAAQGIRFT HGYAGSPWCS STRISLYTGR YPGRLQAGLE EPLVTRSPEN GIPEGHPTLS SLLVEAGYAT AMFGKWHCGW LPWYSPLRIG FETFFGNFDG ALDYFEHVDT LGKADLYEGE TPVEEVGYYT EIISERAAEY ITAHRNRPFY VQLNYTAPHW PWEGPDDHEV GQEIRRRYQQ RWEHSPLMHL DGGSIAKYGE LVEAMDAGIG QVLAALDRAG AADNTIVVFS SDNGGERWSK NWPFVGEKGD LTEGGIRVPL IVAWPEAIAG NQVSDHPVIT MDWTATLLAA AGTEPHPDWP LDGVDLLPWL VDGADFPAHD LFWRTSNQGA LRRGRFKYLR DRRDRAVLGN WPRHYGDYHL LYDVTVDGRE RADIAGQHPE VLAELREAWE RIDAELLPFP TTHVGLPRPR TEGAPAVSEP D
|
| |