Gene Strop_3089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_3089 
Symbol 
ID5059553 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp3539652 
End bp3541877 
Gene Length2226 bp 
Protein Length741 aa 
Translation table11 
GC content70% 
IMG OID640475339 
Producttransketolase 
Protein accessionYP_001159904 
Protein GI145595607 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0021] Transketolase 
TIGRFAM ID[TIGR00232] transketolase, bacterial and yeast 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.37552 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGGAT CCCCGGGCGG ACGTTTAGGA CCGCTCGATA AGGTCACCGG TGAGGGTTCC 
GCCCATCTGT CGAGGAGCAC AACCAACGTG GCTGACAACC GACCCGAGCT TCCTGCACTC
AACTGGTCCG ACCTCGACCG TCAGGCGGTC GACACGGTCC GCGTGCTGGC CATGGACGCC
GTCGAGAAAT CCGGCAACGG CCACCCCGGC ACCGCGATGA GCCTGGCGCC CGCGGCCTAC
CTACTCTTCA ACCGGGTCAT GCGACATAAC CCCGCCGACC CGACCTGGCC CGGCCGCGAC
CGCTTCGTGC TCTCCGCCGG GCACTCCAGC CTCAGCCTCT ACATCCAGCT CTACTTCTCC
GGCTATCCGA TGAGCCTCAC CGACCTCGAG GGGCTGCGGC AGTGGGGCTC GCTCACCCCC
GGTCACCCCG AGTACGGGCA CACCCCGGGG GTGGAGACCA CCACCGGCCC GCTCGGACAG
GGCCTGGGCA ACGCGGTCGG GATGGCGATG GCCGCCCGCC GTGAACGCGG GCTGTTCGAC
CCCGAGCCGA AGCTCGGCAC CTCGGTCTTC GACCACGACA TCTGGTGTAT CGCCTCCGAC
GGCGACATCG AGGAGGGGGT CAGCCACGAG GTCAGCGCCC TCGCCGGCCA CCAGCAGCTG
GGCAATCTCT GCGTGATCTA CGACGACAAC GAGATCTCGA TCGAGGACGA CACCCGGATC
GCCAAGAGCG AGGACGTCGC GGCCCGCTAC CGGGCGTACG GGTGGCACGT GCAGACGGTC
GACTGGCGGC GCGGCGACGC CGACCAGGAC GACTACCACG AGGACGTGGC CACGCTGTAC
CGGGCGCTGC TGGCCGCCAA GGCCGAGACC GGTCGGCCGT CGTTCATCGC CCTGCGCACC
ATCATCGGCT GGCCGGCGCC GAACAAACGG AACACCGGCA AGATCCACGG TTCGGCGCTC
GGTGCCGACG AGGTCGCGGC GACCAAGGAA CTCCTCGGCT TCGACCCGCA GCGCACCTTC
GAGGTCGACG AGGGCGTCCT CAAGCACACC CGCCAGGTGC GCGAACGCGG CATCGCCGCC
CAGCGTGAGT GGACCGAGAC CTTCGAGGCC TGGGGGCAGG CGAACCCGGA ACGCAAGGAG
CTCTGGGACC GAATGGCCAC CCGGACGCTG CCACGAGGCT GGACGGACGC CCTACCCGCG
TTCCCCGCCG ACGCCAAGGG CATCGCCACC CGCGCCGCCT CCGGCACGGT CCTCGGCGCG
CTCGCACCGG TGCTGCCGGA GCTGTGGGGC GGCTCGGCAG ACCTGGCGGA CAGCAACAAC
ACCACCATGA AGGGCGAGCC GTCCTTCATC CCGGCCGAGC ATGCCACCAA GGACTTCCCG
GGCAACGAGT ACGGCCGCAC GCTGCACTTC GGCGTCCGCG AACACGCGAT GGGCGCCATC
CTCAACGGGA TCGCTCTGCA CGGTGGCACC CGCCCGTACG GCGGTACGTT CCTCGTCTTC
AGCGACTACA TGCGCCCGTC GGTACGCCTC GCGGCGATGA TGAAGCTGCC GGTGACCTAC
GTCTGGACAC ACGACTCGAT CGGCCTCGGC GAGGACGGTC CGACCCACCA ACCGGTGGAG
CAGCTGACCT CGCTGCGAGC AATCCCCGGA CTGGATGTGG TACGTCCCGC CGACGCGAAC
GAGACCGCGT GGGCCTGGCG GCAGATCCTG ACGCATACCG ATCGGCCGGC CGCGTTGGCA
CTGAGCCGCC AGCCGTTGCC GACCCTGGAT CGATCCGTGC TCACCAGCGC GGAAGGGGTG
GCCCGCGGCG GGTACGTGCT GGTCGACGCT GTTGGCGGCA AGCCGCAGGT GATCCTTCTC
GCCACCGGTT CGGAGGTGCA GCTCTGCCTC ACCGCCCGGG AGCGGCTGGA GGCCGACGGC
ACCCCCACCC GGGTCGTCTC CATGCCCTGC CAGGAGTGGT TCCGGGCTCA GGATGAGGCG
TACCAGGAGT CGGTCCTACC CCGTGGGGTA AAGGCACGGG TGAGCGTGGA GGCGGGCGTC
GCGATGTCCT GGCGGGCCTT CGTCGGTGAC TGCGGCGAGA GCATCAGCCT GGAGCACTTC
GGGGCGAGCG CCCCGCACAC CGTGCTCTTC GAGCAGTTCG GTTTCACCCC GGACCGGATC
GTGGGTGCGG CGCACGCCGC GCTGACCCGG GTCGGCGACA TCACCGGTAA TCCGACCGGC
AACTGA
 
Protein sequence
MHGSPGGRLG PLDKVTGEGS AHLSRSTTNV ADNRPELPAL NWSDLDRQAV DTVRVLAMDA 
VEKSGNGHPG TAMSLAPAAY LLFNRVMRHN PADPTWPGRD RFVLSAGHSS LSLYIQLYFS
GYPMSLTDLE GLRQWGSLTP GHPEYGHTPG VETTTGPLGQ GLGNAVGMAM AARRERGLFD
PEPKLGTSVF DHDIWCIASD GDIEEGVSHE VSALAGHQQL GNLCVIYDDN EISIEDDTRI
AKSEDVAARY RAYGWHVQTV DWRRGDADQD DYHEDVATLY RALLAAKAET GRPSFIALRT
IIGWPAPNKR NTGKIHGSAL GADEVAATKE LLGFDPQRTF EVDEGVLKHT RQVRERGIAA
QREWTETFEA WGQANPERKE LWDRMATRTL PRGWTDALPA FPADAKGIAT RAASGTVLGA
LAPVLPELWG GSADLADSNN TTMKGEPSFI PAEHATKDFP GNEYGRTLHF GVREHAMGAI
LNGIALHGGT RPYGGTFLVF SDYMRPSVRL AAMMKLPVTY VWTHDSIGLG EDGPTHQPVE
QLTSLRAIPG LDVVRPADAN ETAWAWRQIL THTDRPAALA LSRQPLPTLD RSVLTSAEGV
ARGGYVLVDA VGGKPQVILL ATGSEVQLCL TARERLEADG TPTRVVSMPC QEWFRAQDEA
YQESVLPRGV KARVSVEAGV AMSWRAFVGD CGESISLEHF GASAPHTVLF EQFGFTPDRI
VGAAHAALTR VGDITGNPTG N