Gene Tfu_1353 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTfu_1353 
Symbol 
ID3580367 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermobifida fusca YX 
KingdomBacteria 
Replicon accessionNC_007333 
Strand
Start bp1569342 
End bp1570757 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content66% 
IMG OID637685050 
Productn-acetylgalactosamine-6-sulfate sulfatase 
Protein accessionYP_289414 
Protein GI72161757 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0725989 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCCTGGAA ACAGGGAACG CGCGTTCGAA AGCACTCGCC AGGGACGGCC CGACCAACCC 
CCCAACATCC TGTTCATCCT CGCCGACGAC CTCGGCTGGG CCGACCTGGG CTGCTACGGT
TCCACCACTA TCCGCACTCC CAACCTCGAC CAGCTGGCCG CCCAAGGCAT CCGGTTCACC
CACGGCTATG CGGGATCACC TTGGTGCTCC TCCACCAGGA TCAGTCTGTA CACGGGGCGC
TACCCGGGAC GCTTGCAAGC CGGCCTCGAA GAACCGCTCG TCACCCGTTC CCCGGAAAAC
GGCATCCCGG AAGGCCACCC CACCCTTTCC TCACTTCTCG TTGAGGCCGG GTACGCCACC
GCCATGTTCG GTAAATGGCA CTGCGGCTGG CTGCCGTGGT ACAGCCCGTT GCGGATCGGC
TTCGAAACCT TCTTCGGGAA CTTCGACGGG GCCCTCGACT ACTTCGAACA CGTCGACACT
CTTGGCAAAG CCGACCTGTA CGAAGGGGAG ACCCCGGTTG AGGAAGTCGG CTACTACACG
GAGATCATCT CCGAGCGGGC CGCTGAATAC ATCACCGCGC ACCGCAACCG GCCGTTCTAC
GTGCAACTCA ACTACACCGC GCCCCACTGG CCGTGGGAAG GACCCGACGA CCACGAAGTC
GGACAAGAAA TCCGCCGCCG CTACCAGCAG AGATGGGAGC ACTCCCCGCT GATGCACCTG
GACGGCGGTT CGATCGCTAA ATACGGCGAA CTTGTGGAAG CCATGGACGC GGGCATCGGA
CAGGTCCTGG CCGCTCTCGA CAGGGCCGGG GCCGCCGACA ACACTATCGT CGTTTTCTCC
TCCGACAACG GTGGGGAACG CTGGTCGAAA AACTGGCCGT TCGTGGGGGA GAAAGGCGAC
CTCACCGAAG GCGGCATCCG TGTCCCGCTC ATCGTGGCCT GGCCGGAAGC GATAGCCGGA
AACCAGGTGA GCGACCATCC CGTCATCACC ATGGACTGGA CAGCGACCCT GCTCGCCGCC
GCGGGAACCG AACCGCACCC CGACTGGCCG CTCGACGGTG TGGACCTGCT GCCGTGGCTG
GTCGACGGCG CCGACTTTCC CGCCCACGAC CTGTTCTGGC GCACCTCCAA CCAGGGGGCG
CTGCGGCGAG GCCGGTTCAA ATACCTGCGT GACCGGCGGG ACCGCGCCGT GCTCGGCAAC
TGGCCGCGCC ACTACGGCGA CTACCACCTG CTCTACGACG TGACCGTGGA CGGCCGGGAA
CGAGCTGATA TTGCCGGACA GCATCCCGAA GTGCTCGCCG AACTGCGGGA AGCCTGGGAG
CGGATCGACG CGGAACTGCT GCCCTTCCCG ACCACCCACG TGGGCCTGCC GCGGCCCCGC
ACCGAAGGAG CCCCGGCCGT GAGCGAGCCG GACTGA
 
Protein sequence
MPGNRERAFE STRQGRPDQP PNILFILADD LGWADLGCYG STTIRTPNLD QLAAQGIRFT 
HGYAGSPWCS STRISLYTGR YPGRLQAGLE EPLVTRSPEN GIPEGHPTLS SLLVEAGYAT
AMFGKWHCGW LPWYSPLRIG FETFFGNFDG ALDYFEHVDT LGKADLYEGE TPVEEVGYYT
EIISERAAEY ITAHRNRPFY VQLNYTAPHW PWEGPDDHEV GQEIRRRYQQ RWEHSPLMHL
DGGSIAKYGE LVEAMDAGIG QVLAALDRAG AADNTIVVFS SDNGGERWSK NWPFVGEKGD
LTEGGIRVPL IVAWPEAIAG NQVSDHPVIT MDWTATLLAA AGTEPHPDWP LDGVDLLPWL
VDGADFPAHD LFWRTSNQGA LRRGRFKYLR DRRDRAVLGN WPRHYGDYHL LYDVTVDGRE
RADIAGQHPE VLAELREAWE RIDAELLPFP TTHVGLPRPR TEGAPAVSEP D