Gene Strop_2098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_2098 
Symbol 
ID5058561 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp2371774 
End bp2372778 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content71% 
IMG OID640474361 
Producttransketolase, central region 
Protein accessionYP_001158927 
Protein GI145594630 
COG category[C] Energy production and conversion 
COG ID[COG0022] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, beta subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.0340316 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.234627 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAGCA TGACCATGGC TAAGGCGCTC AACACCGCGC TCGCCGACGC GATGCTCGAC 
GACGACCGGG TGGTCGTCTT CGGTGAGGAC GTCGGCCAAC TCGGCGGGGT CTTCCGGATC
ACCGACGGGC TGGCGGCCCG CTTCGGCGAC AAGCGCTGCT TCGACACACC GCTCGCCGAG
GCTGGCATCG TCGGTTTCGC GGTTGGCCTC GCCATGTCCG GGCTGCGGCC GGTGGTGGAG
ATGCAGTTCG ACGCGTTTGG GTACCCGGCC TTCGAGCAGA TCGCCTCGCA CGTGGCGAAG
CTGCGCAACC GCACCCGTGG CGCGCTGAGC GCGCCCATCG TCATCCGGAT CCCGTACGCG
GGCGGCATCG GCGGGGTGGA GCACCACTGC GACTCCTCCG AGGCGTACTA CGCGCACACC
CCCGGTCTGA AGGTCGTCAC CCCGGCCACC GTGACCGATG CCTACTCGCT GCTGCGTGCG
GCGATCGACG ATCCGGACCC GGTCGTTTTC CTGGAGCCGA AGAAGCTCTA CTTCGCCAGC
GCCGAGACGC AGTTGCCAGC TCGGACCGAG CCGTTCGGCC GCGCCGTCGT ACGCCGTCGG
GGCACTGATG CCACCCTGGT CGCGTACGGG CCGGCGGTGC CGGTGGCCCT GGCAGCCGCC
GAGGCGGCCC AGGAGGAGGG CTGGAACCTC GAAGTCGTTG ACGTGCGGAC GATCGTACCG
TTCGACGACG GCACGATCGC GGCGTCGGTG CGAAAGACGG GCCGGTGCGT GGTGGTCCAG
GAGGCCCAGG GCTTCGCCGG AGTCGGCGCG GAGATCGCCG CGCGGGTGCA GGAGCGTTGC
TTCCACTCCC TACACGCGCC GGTGCTGCGG GTTGCCGGGC TGGACATCCC CTATCCGGCG
CCGATGCTGG AGCACACCCA CCTGCCGTCG GTGGATCGGG TGCTCGACGC GGTGGCCCGC
CTCCAGTGGG ACGACCAGCC CGACGAGCGA TGGGTGGCGG CCTGA
 
Protein sequence
MASMTMAKAL NTALADAMLD DDRVVVFGED VGQLGGVFRI TDGLAARFGD KRCFDTPLAE 
AGIVGFAVGL AMSGLRPVVE MQFDAFGYPA FEQIASHVAK LRNRTRGALS APIVIRIPYA
GGIGGVEHHC DSSEAYYAHT PGLKVVTPAT VTDAYSLLRA AIDDPDPVVF LEPKKLYFAS
AETQLPARTE PFGRAVVRRR GTDATLVAYG PAVPVALAAA EAAQEEGWNL EVVDVRTIVP
FDDGTIAASV RKTGRCVVVQ EAQGFAGVGA EIAARVQERC FHSLHAPVLR VAGLDIPYPA
PMLEHTHLPS VDRVLDAVAR LQWDDQPDER WVAA