Gene Tcur_4809 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTcur_4809 
Symbol 
ID8606171 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermomonospora curvata DSM 43183 
KingdomBacteria 
Replicon accessionNC_013510 
Strand
Start bp5452205 
End bp5453479 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content71% 
IMG OID 
Productsulfotransferase 
Protein accessionYP_003302364 
Protein GI269128994 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGCCCC CCGACGTCCA CATCGACGAT CTCGCCGAAC CGCGTTTCTC CCCCGAGGCC 
CAGCAGCTCA TCGACGGCCT GACGGCGATG GCGCCGGCCT GCACGCTGGA GCCCGAGGCG
CTGATGGAGA TGGCCGTCCA GCAGACCGAG GCGACCGGGC GGCGCATGGA GGACTTCGGC
GATGAGGCCT TCCGCGAGCC GCTGGAGATC CTGTGCCGGT CGCTGCGGGA GGAGGCCGGG
CTGGCCGAGC ACGGCAAGGT CGTCTGGCAC GCCCAGCTGC TGGCCCAGCT CACCCAGCGG
CTGCGGTTGC AGGACCTGCT GAACCGGCAC CCGGAAATCC ACGACGTGCA GATCGAGCGG
CCGATCATCA TCGCCGGGCT GCCCCGCACC GGCACCACCC ACCTGCACAA CCTGCTGTCG
GCCGACCCGG CGCTGCGCCC GCTGCCCTAC TGGGAGGCCT GCGAGCCCGT CCCGCCGCCC
GGCGAGGAGG GCACGATCGA GCCGCGCATC CAGCGGGTGG CGGCCTCGCT GCACCTGGTG
CACACCACGC TGCCGTACCT GAAGCGGATG TTCGACCTGA CGCCGACCTA CTCCCACGAA
GAGGGCGGCC TGCTGGCGCT GACGTTCGCC TCCACACACC TGGAGATCCA GGCGATGGTG
CCGTCCTATA GGGACTGGTA CCTGGGCACC GACCAGACGT TCGCCTACGA GTACCTGCGC
ACCGCGCTCA AGGCCATCAC CTGGCTGCGG GGCGGCGGGC GCTGGGTGCT CAAGGCCCCC
CAGCACCTGG AGCAGCTCGG CCCGCTGATG AAGGTGTTCC CGGACGCCAC CGTGGTGATC
ACCCACCGCG ACCCGGTGGC GGTCACCGCG TCGCTGACCA CCATGCTGTG CTACGGGCTG
CGCATGACGA CCTACCCGAT CGACCCGCAC GCCGTAGGCG CCTACTGGCG CGACCGGTCG
GCCATCTACA TGGAACGCTG CCTGCGCGAC CGCGACCTGG TGCCCAAGGA GCAGTCGATC
GACGTGCTGT TCCACGAGTT CATGGCCGAC GACATCGCCA TGGTGGAACG CATCTACCAG
GTGGCCGGCC AGCCGTTCAC CGAGGAGACC CGGGCGGCGA TGGAGGCGTA CATGGCCGAG
CATCCCCGGG GCCGGCACGG GCGGGTGGAC TACCGGCTGG CGGACATCGG GCTGGAGCTG
GCCGAACGGC GGCGGGCGCT GGCCCCCTAC GCCGAGCGCT TCGGCACGAA GGAGGAGCCG
GTCAAGGAGC GCTGA
 
Protein sequence
MRPPDVHIDD LAEPRFSPEA QQLIDGLTAM APACTLEPEA LMEMAVQQTE ATGRRMEDFG 
DEAFREPLEI LCRSLREEAG LAEHGKVVWH AQLLAQLTQR LRLQDLLNRH PEIHDVQIER
PIIIAGLPRT GTTHLHNLLS ADPALRPLPY WEACEPVPPP GEEGTIEPRI QRVAASLHLV
HTTLPYLKRM FDLTPTYSHE EGGLLALTFA STHLEIQAMV PSYRDWYLGT DQTFAYEYLR
TALKAITWLR GGGRWVLKAP QHLEQLGPLM KVFPDATVVI THRDPVAVTA SLTTMLCYGL
RMTTYPIDPH AVGAYWRDRS AIYMERCLRD RDLVPKEQSI DVLFHEFMAD DIAMVERIYQ
VAGQPFTEET RAAMEAYMAE HPRGRHGRVD YRLADIGLEL AERRRALAPY AERFGTKEEP
VKER