Gene Tcur_0054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTcur_0054 
Symbol 
ID8601346 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermomonospora curvata DSM 43183 
KingdomBacteria 
Replicon accessionNC_013510 
Strand
Start bp55918 
End bp57303 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content71% 
IMG OID 
Productsulfatase 
Protein accessionYP_003297700 
Protein GI269124330 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones71 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCCGGGA CGTTCCCGTT GACGCGGCGC AGGGTGCTGC TGTCCGGGGC GGTGGCCGCC 
ACGGTGGCGG GGATCTGCGG CGGGACGTCC CGTCCGGTGT CGGGCCGGGG GCGGCCCAAC
GTGCTGCTGC TGGTCACCGA CGATCAGCCG TTGCACACCG AGTGGGCCAT GCCGGTTCTG
CGCGACATGA TCAAACGGAG CGGTGTGCGT TTCACCCGCG CCTATGCCAC GACGCCGCTG
TGCGGGCCGT CCCGTGCCTC GATCCTGTCG GGGCGGTACG CCCATAACCA CGGGGTGCTG
CAGAACGGCC GTCCCGAGCG GCTGGATCAG AGCACCGTCC TGCCCCGCTA CCTGCGGGAG
GCCGGGTACC GCACCGCGAT GTTCGGCAAG TACCTCAACG GCTGGGACGT CCACCAGGCT
CCGCCGCACT TTGACGAGTA CGCCCTCATG CACCCGCCCA AGTACGGCGA GACCTGGTGG
AACGTCAACG GGAAGGTGAG CAGGAAGCGC GCCTACAGCA CCTCCCTCAT CAGGGACCAC
GCCGTGCGGT TCCTGCGGCG GCACCGCGCG AGCGGACGCC CGTGGTTCCT GTACCTGACT
CCCTACGCCC CCCACGCGCC GTTCACCCCT GAGGCGCGGT ATGCGAATCT GAGCGTCCCT
TCATGGCGCG GCAACCCCGC CGTCGCCGAG TCGGATAAGC GGGACAAGCC GTTCTATATT
CGGCGGTCCG ACCCCGATCT TCACCGTGCC CGCCGCATCC GCGCCGGTCA GCTGCGCACC
TTACGCTCCG TGGACGACCT GCTCGGCGCG GTGCGGGACG AGCTGCGCGC CCAGCGCCGG
CTGGACGACA CGCTCATCAT CGTCATCAGC GACAACGGCT ACTGCTGGGG GGATCACGGC
TGGCACGCCA AGAGCGTCCC CTACTCCCCC GCGGTCCGCA TCCCGCTGTA CCTGTCGTGG
CCGGCCGGCG GGCTCGGCAG GGGCGCCACC GACGACCGGC TGGTGGCCAA CATCGACATC
ATGCCCACGA TCTTGGACGC GGCGGGCATC GATCCCGGCG CCGCGAGACT GGACGGCCGG
TCGCTGCTGC GTCCCGGGGA ACGCGACCGG CTGCTGTTGG AATGGTGGAA GAGGGGCCCG
GGGCAGGCCG GGCATAGCTG GGCGGCCACG GTCACGCGGG ACTACCAGTA CATCGAGCAC
TACGACACCA TCTTGCGCCG GGGCAGGCCG GTGGGGTCGG GAGCGGTGGT GCATCGCGAG
TACTACGACT TGCGCAAGGA CCCGCACCAG CTCACCAACC TGCTGCACCG CACGGGGTCC
GGCGTGGCGC GGCGGCTGGA CGTGGCGGGT CTGTCCGCCC GCCTGGCGGC CGACCGGAGG
GCCTGA
 
Protein sequence
MSGTFPLTRR RVLLSGAVAA TVAGICGGTS RPVSGRGRPN VLLLVTDDQP LHTEWAMPVL 
RDMIKRSGVR FTRAYATTPL CGPSRASILS GRYAHNHGVL QNGRPERLDQ STVLPRYLRE
AGYRTAMFGK YLNGWDVHQA PPHFDEYALM HPPKYGETWW NVNGKVSRKR AYSTSLIRDH
AVRFLRRHRA SGRPWFLYLT PYAPHAPFTP EARYANLSVP SWRGNPAVAE SDKRDKPFYI
RRSDPDLHRA RRIRAGQLRT LRSVDDLLGA VRDELRAQRR LDDTLIIVIS DNGYCWGDHG
WHAKSVPYSP AVRIPLYLSW PAGGLGRGAT DDRLVANIDI MPTILDAAGI DPGAARLDGR
SLLRPGERDR LLLEWWKRGP GQAGHSWAAT VTRDYQYIEH YDTILRRGRP VGSGAVVHRE
YYDLRKDPHQ LTNLLHRTGS GVARRLDVAG LSARLAADRR A